Sunday 20 September 2015

ElasticSearch

Elastic search is a open-source distributed analytics search engine, built on top of Apache Lucene. Elastic search can run on your Hadoop cluster, and can handle petabytes of data.

Why Elastic Search?
Elastic search makes your searching and analyzing process very easy.

Why we need a search engine?
Today we are overwhelmed by huge amount of data. All traditional approaches are best fit to store data (Like RDBMS) and run transactions, not for full text search.

Are these traditional approaches really capable of analyzing all your company historical data (Assume you had peta bytes of data, both in structured and unstructured formats)?
Answer is no…well we will see why our traditional RDBMS is not best fit..
1.   RDBMS follows schema approach. The data you store into RDBMS must has fixed format. But in real time, almost much amount of data is unstructured; there is no fixed schema for that, like facebook posts, text mining data, sentiment analysis on social media etc.
2.   RDBMS is not horizontally scalable. Horizontal scalability is the ability to increase capacity by connecting multiple hardware or software entities so that they work as a single logical unit.
3.   RDBMS can’t work in distributed environment.



Prevoius                                                 Next                                                 Home

No comments:

Post a Comment