Elastic
search is a open-source distributed analytics search engine, built on top of
Apache Lucene. Elastic search can run on your Hadoop cluster, and can handle
petabytes of data.
Why Elastic Search?
Elastic
search makes your searching and analyzing process very easy.
Why we need a search engine?
Today we are
overwhelmed by huge amount of data. All traditional approaches are best fit to
store data (Like RDBMS) and run transactions, not for full text search.
Are these traditional approaches really capable of
analyzing all your company historical data (Assume you had peta bytes of data,
both in structured and unstructured formats)?
Answer is
no…well we will see why our traditional RDBMS is not best fit..
1.
RDBMS
follows schema approach. The data you store into RDBMS must has fixed format.
But in real time, almost much amount of data is unstructured; there is no fixed
schema for that, like facebook posts, text mining data, sentiment analysis on
social media etc.
2.
RDBMS
is not horizontally scalable. Horizontal scalability is the ability to increase
capacity by connecting multiple hardware or software entities so that they work
as a single logical unit.
3. RDBMS can’t work in distributed environment.
3. RDBMS can’t work in distributed environment.
No comments:
Post a Comment