Indexing HBase, one row at a time
HBase Indexer is an open source project for indexing HBase rows into Solr. It provides a flexible and extensible way of defining indexing rules, and is designed to work at scale.
Lily HBase Indexer provides the ability to quickly and easily search for any content stored in HBase. It allows you to easily index HBase rows into Solr, without writing a line of code. It piggybacks on the HBase replication mechanism and provides a simple, configuration-based and extensible indexing engine to feed row updates into Solr document updates.
It doesn’t require Lily, but originates from years of experience indexing HBase as part of Lily Enterprise. Indexing is performed asynchronously, so it does not impact write throughput even with peak loads.
The SEP trigger notification mechanism is used inside Lily as well for forwarding DNA profile updates to other system components.
- Apache HBase replication
- Apache Solr
- Cloudera Morphlines
Amongst others, it is used in Lily Enterprise to drive UI search, and Cloudera Search for HBase support. HBase Indexer has been designed and implemented by NGDATA and is now collaborated upon by Cloudera.