Apache Accumulo 4.0 Feature Preview
Apache Accumulo is a scalable distributed key/value store using HDFS and ZooKeeper, featuring server-side programming, cell-based access control, and dynamic node scaling. Latest versions are 3.0.0 and 2.1.3.
Read original articleApache Accumulo is a distributed key/value store designed for robust and scalable data storage and retrieval. It utilizes Apache Hadoop's HDFS for data storage and Apache ZooKeeper for consensus management. Accumulo allows users to manage large datasets across clusters and supports server-side programming through a mechanism called Iterators, which can modify key/value pairs during data management. It features cell-based access control, where each key/value pair has a security label that restricts query results based on user authorizations. The system is designed to scale, allowing nodes to be added or removed as data storage needs change. Accumulo maintains a stable client API that adheres to long-term maintenance (LTM) releases and semantic versioning, with each release undergoing extensive testing. The latest versions include Accumulo 3.0.0 and 2.1.3, with a preview of version 4.0 expected in October 2024.
- Apache Accumulo is a scalable, distributed key/value store.
- It uses HDFS for storage and ZooKeeper for consensus.
- Features include server-side programming and cell-based access control.
- Accumulo supports dynamic scaling of nodes based on data needs.
- The latest releases include versions 3.0.0 and 2.1.3, with a 4.0 preview coming soon.
Related
Apache Zeppelin
Apache Zeppelin is an open-source web-based notebook for interactive data analytics, supporting multiple programming languages. The latest version 0.11.1 features Java 11, JDBC connections, and collaborative tools.
Apache Cassandra 5.0 Is Generally Available
Apache Cassandra 5.0 has been released, featuring improved usability, Storage Attached Indexes, Trie optimizations, JDK 17 support, a Unified Compaction Strategy, and vector search capabilities, prompting upgrades from version 3.x.
The Essence of Apache Kafka
Apache Kafka is a distributed event-driven architecture that enables efficient real-time data streaming, ensuring fault tolerance and scalability through an append-only log structure and partitioned topics across multiple nodes.
A FLOSS platform for data analysis pipelines that you probably haven't heard of
Arvados is an open-source platform for managing large datasets, featuring Keep for storage, Crunch for workflow orchestration, and ensuring data security. Users can access it via web, command line, or API.
Show HN: Apache ResilientDB, High-Performance Open-Source Blockchain
Apache ResilientDB is an incubating distributed ledger project by The Apache Software Foundation, focusing on high throughput, scalability, and integrating privacy and transparency in blockchain technology with modern features.
Related
Apache Zeppelin
Apache Zeppelin is an open-source web-based notebook for interactive data analytics, supporting multiple programming languages. The latest version 0.11.1 features Java 11, JDBC connections, and collaborative tools.
Apache Cassandra 5.0 Is Generally Available
Apache Cassandra 5.0 has been released, featuring improved usability, Storage Attached Indexes, Trie optimizations, JDK 17 support, a Unified Compaction Strategy, and vector search capabilities, prompting upgrades from version 3.x.
The Essence of Apache Kafka
Apache Kafka is a distributed event-driven architecture that enables efficient real-time data streaming, ensuring fault tolerance and scalability through an append-only log structure and partitioned topics across multiple nodes.
A FLOSS platform for data analysis pipelines that you probably haven't heard of
Arvados is an open-source platform for managing large datasets, featuring Keep for storage, Crunch for workflow orchestration, and ensuring data security. Users can access it via web, command line, or API.
Show HN: Apache ResilientDB, High-Performance Open-Source Blockchain
Apache ResilientDB is an incubating distributed ledger project by The Apache Software Foundation, focusing on high throughput, scalability, and integrating privacy and transparency in blockchain technology with modern features.