A SURVEY PAPER ON ELASTIC SEARCH SIMILARITY ALGORITHM

Authors

  • Nilanjana Dev Nath School of Computing Science and Engineering, VIT University, Chennai Campus, Tamil Nadu, India
  • Shreekant Jha Intel Technologies Pvt. Ltd., Bengaluru, Karnataka, India.
  • Janki Meena M School of Computing Science and Engineering, VIT University, Chennai Campus, Tamil Nadu, India
  • Syedibrahim S.p School of Computing Science and Engineering, VIT University, Chennai Campus, Tamil Nadu, India

DOI:

https://doi.org/10.22159/ajpcr.2017.v10s1.19757

Keywords:

Elasticsearch, Lucene, BigData, Ranking Algorithm, Indexing, Mapping, Scoring

Abstract

Elasticsearch is a web search tool in view of Lucene. Apache Lucene is a free and open-source data retrieval programming library. Versatile Search gives a conveyed, multitenant-fit full-content web search tool with a HTTP web interface and pattern free JSON archives. It is created in Java and has been released as open source under the terms of the Apache License. Elasticsearch can be utilized to pursuit a wide range of records. It gives adaptable hunt, has close continuous pursuit, and backings multitenancy. It is appropriated, which implies that records can be partitioned into shards and every shard can have zero or more duplicates. Every hub has one or more shards, and goes about as a facilitator to delegate operations to the right shard(s). Elasticsearch is like a wrapper on top of Lucene. In this paper a detailed description of how lucene's scoring algorithm works and how elasticsearch uses it as similarity algorithmâ€

Downloads

Download data is not yet available.

References

Kononenko O. Mining Modern Repositories with Elasticsearch. Hyderabad, India: MSR; 2014.

Sematext. Elasticsearch Refresh Interval vs. Indexing Performance. Available from: http://www.bit.ly/1iZoPGc.

Divya MS, Goyal SK. Elastic search: An advanced and quick search technique to handle voluminous data. COMPUSOFT Int J Adv Comput Technol 2013;2(6):171-5.

Long B, Chapelle JB, Zhang Y. Ranking through expected loss optimization. IEEE Trans Knowl Data Eng 2015;27(5):267-74.

Lin J, Ryaboy D, Weil K. Full-text indexing for optimizing selection operations in large-scale data analytics. San Jose, California, New York, USA: ACM; 2011.Available from: http://www.acm.org/publications.

Unpluggable Similarity. Available from: https://www.elastic.co/guide/en/ elasticsearch/guide/current/pluggablesimilarites.htm.

Butler MH, Rutherford J. Distributed Lucene: A Distributed Free Text Index for Hadoop. ???: HP Laboratories, HPL; 2008.

Published

01-04-2017

How to Cite

Nath, N. D., S. Jha, J. M. M, and S. S.p. “A SURVEY PAPER ON ELASTIC SEARCH SIMILARITY ALGORITHM”. Asian Journal of Pharmaceutical and Clinical Research, vol. 10, no. 13, Apr. 2017, pp. 361-4, doi:10.22159/ajpcr.2017.v10s1.19757.

Issue

Section

Original Article(s)