Table of Contents
HBase
Return to Misconfigured HBase,Databases, DB-Engines.com GitHub star Ranking for Organizations, GitHub Star Ranking for Repositories, GitOps or GitHub
TLDR: HBase, introduced in 2008, is a distributed, open-source NoSQL database designed for real-time processing of large-scale structured and semi-structured data. Built on top of Hadoop, it provides low-latency access to vast amounts of key-value and column-oriented data types. HBase excels in scenarios requiring high write throughput and random read access, making it ideal for real-time data analytics, IoT workloads, and large-scale transaction processing in industries such as telecommunications and e-commerce.
HBase integrates seamlessly with the Hadoop ecosystem, offering advanced features like automatic sharding, replication, and failover. It supports the HBase Shell as its query language and integrates with Apache Spark and Apache Hive for advanced analytics and batch processing. Its compatibility with popular programming terms such as Java, Python, and Scala enables developers to build robust and scalable applications. Widely used in hybrid and cloud database environments, HBase ensures reliable performance and scalability for enterprise-grade data science and operational analytics.
https://github.com/apache/hbase
https://en.wikipedia.org/wiki/Apache_HBase
- Snippet from Wikipedia: Apache HBase
HBase is an open-source non-relational distributed database modeled after Google's Bigtable and written in Java. It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed File System) or Alluxio, providing Bigtable-like capabilities for Hadoop. That is, it provides a fault-tolerant way of storing large quantities of sparse data (small amounts of information caught within a large collection of empty or unimportant data, such as finding the 50 largest items in a group of 2 billion records, or finding the non-zero items representing less than 0.1% of a huge collection).
HBase features compression, in-memory operation, and Bloom filters on a per-column basis as outlined in the original Bigtable paper. Tables in HBase can serve as the input and output for MapReduce jobs run in Hadoop, and may be accessed through the Java API but also through REST, Avro or Thrift gateway APIs. HBase is a wide-column store and has been widely adopted because of its lineage with Hadoop and HDFS. HBase runs on top of HDFS and is well-suited for fast read and write operations on large datasets with high throughput and low input/output latency.
HBase is not a direct replacement for a classic SQL database, however Apache Phoenix project provides a SQL layer for HBase as well as JDBC driver that can be integrated with various analytics and business intelligence applications. The Apache Trafodion project provides a SQL query engine with ODBC and JDBC drivers and distributed ACID transaction protection across multiple statements, tables and rows that use HBase as a storage engine.
HBase is now serving several data-driven websites but Facebook's Messaging Platform migrated from HBase to MyRocks in 2018. Unlike relational and traditional databases, HBase does not support SQL scripting; instead the equivalent is written in Java, employing similarity with a MapReduce application.
In the parlance of Eric Brewer's CAP Theorem, HBase is a CP type system.
YouTube Videos
Cloud Monk recommends the following YouTube video:
GitHub Tags
External Sites
Main
- hbase
Interesting Articles
Support Resources, FAQs, Q&A, Docs, Blogs
Search Engines
Repos and Registries
Package Managers Artifact Registries and Repos:
- https://hub.docker.com/search?q=hbase &type=image - Docker Hub Search
Courses
Books
Vidcasts-Podcasts
Database: Databases on Kubernetes, Databases on Containers / Databases on Docker, Cloud Databases (DBaaS). Database Features, Concurrent Programming and Databases, Functional Concurrent Programming and Databases, Async Programming and Databases, Database Security, Database Products (MySQL, Oracle Database, Microsoft SQL Server, MongoDB, PostgreSQL, SQLite, Amazon RDS, IBM Db2, MariaDB, Redis, Cassandra, Amazon Aurora, Microsoft Azure SQL Database, Neo4j, Google Cloud SQL, Firebase Realtime Database, Apache HBase, Amazon DynamoDB, Couchbase Server, Elasticsearch, Teradata Database, Memcached, Amazon Redshift, SQLite, CouchDB, Apache Kafka, IBM Informix, SAP HANA, RethinkDB, InfluxDB, MarkLogic, ArangoDB, RavenDB, VoltDB, Apache Derby, Cosmos DB, Hive, Apache Flink, Google Bigtable, Hadoop, HP Vertica, Alibaba Cloud Table Store, InterSystems Caché, Greenplum, Apache Ignite, FoundationDB, Amazon Neptune, FaunaDB, QuestDB, Presto, TiDB, NuoDB, ScyllaDB, Percona Server for MySQL, Apache Phoenix, EventStoreDB, SingleStore, Aerospike, MonetDB, Google Cloud Spanner, SQream, GridDB, MaxDB, RocksDB, TiKV, Oracle NoSQL Database, Google Firestore, Druid, SAP IQ, Yellowbrick Data, InterSystems IRIS, InterBase, Kudu, eXtremeDB, OmniSci, Altibase, Google Cloud Bigtable, Amazon QLDB, Hypertable, ApsaraDB for Redis, Pivotal Greenplum, MapR Database, Informatica, Microsoft Access, Tarantool, Blazegraph, NeoDatis, FileMaker, ArangoDB, RavenDB, AllegroGraph, Alibaba Cloud ApsaraDB for PolarDB, DuckDB, Starcounter, EventStore, ObjectDB, Alibaba Cloud AnalyticDB for PostgreSQL, Akumuli, Google Cloud Datastore, Skytable, NCache, FaunaDB, OpenEdge, Amazon DocumentDB, HyperGraphDB, Citus Data, Objectivity/DB). Database drivers (JDBC, ODBC), ORM (Hibernate, Microsoft Entity Framework), SQL Operators and Functions, Database IDEs (JetBrains DataSpell, SQL Server Management Studio, MySQL Workbench, Oracle SQL Developer, SQLiteStudio), Database keywords, SQL (SQL keywords - (navbar_sql), Relational databases, DB ranking, Database topics, Data science (navbar_datascience), Apache CouchDB, Oracle Database (navbar_oracledb), MySQL (navbar_mysql), SQL Server (T-SQL - Transact-SQL, navbar_sqlserver), PostgreSQL (navbar_postgresql), MongoDB (navbar_mongodb), Redis, IBM Db2 (navbar_db2), Elasticsearch, Cassandra (navbar_cassandra), Splunk (navbar_splunk), Azure SQL Database, Azure Cosmos DB (navbar_azuredb), Hive, Amazon DynamoDB (navbar_amazondb), Snowflake, Neo4j, Google BigQuery, Google BigTable (navbar_googledb), HBase, ScyllaDB, DuckDB, SQLite, Database Bibliography, Manning Data Science Series, Database Awesome list (navbar_database - see also navbar_datascience, navbar_data_engineering, navbar_cloud_databases, navbar_aws_databases, navbar_azure_databases, navbar_gcp_databases, navbar_ibm_cloud_databases, navbar_oracle_cloud_databases, navbar_scylladb)
Database Navbar
Database | Database management system:
Related Topics:
Category:Database_management_systems | Category
Cloud Monk is Retired ( for now). Buddha with you. © 2025 and Beginningless Time - Present Moment - Three Times: The Buddhas or Fair Use. Disclaimers
SYI LU SENG E MU CHYWE YE. NAN. WEI LA YE. WEI LA YE. SA WA HE.