Questions tagged [accumulo]

The Apache Accumulo sorted, distributed key/value store is based on Google's BigTable design. It is built on top of Apache Hadoop, ZooKeeper, and Thrift. It features a few novel improvements on the BigTable design in the form of cell-level access labels and a server-side programming mechanism that can modify key/value pairs at various points in the data management process.

The Apache Accumulo sorted, distributed key/value store is a robust, scalable, high performance data storage and retrieval system. Apache Accumulo is based on Google's BigTable design and is built on top of Apache Hadoop, ZooKeeper, and Thrift. Apache Accumulo features a few novel improvements on the BigTable design in the form of cell-based access control and a server-side programming mechanism that can modify key/value pairs at various points in the data management process. Other notable improvements and feature are outlined here.

Google published the design of BigTable in 2006. Several other open source projects have implemented aspects of this design including HBase, Hypertable, and Cassandra. Accumulo began its development in 2008 and joined the Apache community in 2011.

Official Website: https://accumulo.apache.org/

268 questions
10
votes
4 answers

NoSQL for time series/logged instrument reading data that is also versioned

My Data It's primarily monitoring data, passed in the form of Timestamp: Value, for each monitored value, on each monitored appliance. It's regularly collected over many appliances and many monitored values. Additionally, it has the quirky feature…
jsmith54
  • 131
  • 1
  • 4
9
votes
2 answers

problems while configuring accumulo

In my system Hadoop and zookeeper working properly. Now I have just configured Accumulo.But when i am going to initialize Accumulo by accumulo init it is showing following errors. [root@hydDev32 bin]# ./accumulo init Uncaught exception: Provider…
Rajesh Barri
  • 502
  • 5
  • 21
7
votes
1 answer

Accumulo high speed ingest options

In a nutshell, I have a customer who wants the data contained in a bunch of ASCII text files (a.k.a "input files") ingested into Accumulo. These files are output from diverse data feed devices and will be generated continuously on…
jhop
  • 71
  • 3
6
votes
1 answer

Hadoop/YARN job FAILED - "exited with exitCode: -1000 due to: Could not find any valid local directory for nmPrivate..."

I am trying to run a MapReduce job with Hadoop, YARN and Accumulo. I am getting the following output that I cannot track down the issue. Looks to be a YARN issue, but I am not sure what it is looking for. I have a nmPrivate folder at location…
bdparrish
  • 3,216
  • 3
  • 37
  • 58
5
votes
2 answers

How do I create a Spark RDD from Accumulo 1.6 in spark-notebook?

I have a Vagrant image with Spark Notebook, Spark, Accumulo 1.6, and Hadoop all running. From notebook, I can manually create a Scanner and pull test data from a table I created using one of the Accumulo examples: val instanceNameS = "accumulo" val…
snerd
  • 1,238
  • 1
  • 14
  • 28
5
votes
1 answer

Getting error while running storm connection with accumulo

I have Storm Bolt like follows, package storm.bolt; import java.util.Map; import org.apache.accumulo.core.client.AccumuloException; import org.apache.accumulo.core.client.AccumuloSecurityException; import…
Sreejith
  • 587
  • 1
  • 9
  • 18
5
votes
2 answers

When it comes to mapreduce how are the Accumulo tablets mapped to an HDFS block

If my environment set up is as follows: -64MB HDFS block -5 tablet servers -10 tablets of size 1GB each per tablet server If I have a table like below: rowA | f1 | q1 | v1 rowA | f1 | q2 | v2 rowB | f1 | q1 | v3 rowC | f1 | q1 | v4 rowC | f2 | q1…
chapstick
  • 713
  • 6
  • 16
  • 25
5
votes
2 answers

How to remove instance of accumulo?

I have created a instance while initializing accumulo by calling accumulo init But now i want to remove that instance and as well i want to create a new instance. Can any one help to do this?
Rajesh Barri
  • 502
  • 5
  • 21
4
votes
1 answer

Connecting to Accumulo inside a Mapper using Kerberos

I am moving some software from an older Hadoop Cluster (uses username/password authentication) to a newer one, 2.6.0-cdh5.12.0 which has Kerberos authentication enabled. I have been able to get many of existing Map/Reduce jobs that use Accumulo for…
4
votes
2 answers

Accumulo init - [start.Main] ERROR: initializing the class loader

I'm new to Accumulo and trying to install v1.7 on a Cloudera VM. I have Java 1.7 and HDP 2.2, and Zookeeper is currently running. I've mainly been trying to follow the INSTALL.md without incident and have configured Accumulo however get the…
jhole89
  • 718
  • 9
  • 28
4
votes
1 answer

How to get count for database query in Accumulo

Every database I've ever seen has a method for retrieving the count of the query prior to actually executing it. But I can't figure how to do this simple task in Accumulo. Just for clarity, I want the Accumulo analog of this Mongo feature. I checked…
Mike S
  • 11,329
  • 6
  • 41
  • 76
4
votes
1 answer

ZooKeeperBindException when starting MiniAccumuloCluster

I'm attempting to start up a MiniAccumuloCluster for testing as described in the Accumulo Docs: TemporaryFolder folder= new TemporaryFolder(); File temp_dir = folder.newFolder("AccumuloTempFolder"); MiniAccumuloCluster accumulo = new…
bkaiser
  • 647
  • 8
  • 22
4
votes
1 answer

How do I set an environment variable in a YARN Spark job?

I'm attempting to access Accumulo 1.6 from an Apache Spark job (written in Java) by using an AccumuloInputFormat with newAPIHadoopRDD. In order to do this, I have to tell the AccumuloInputFormat where to locate ZooKeeper by calling the…
Trebor Rude
  • 1,904
  • 1
  • 21
  • 31
4
votes
2 answers

Accumulo Overview console not reachable outside of VirtualBox VM

I am running Accumulo 1.5 in an Ubuntu 12.04 VirtualBox VM. I have set the accumulo-site.xml instance.zookeeper.host file to the VM's IP address, and I can connect to accumulo and run queries from a remote client machine. From the client machine, I…
J21042
  • 1,214
  • 4
  • 19
  • 33
4
votes
1 answer

How to filter Scan on Accumulo using RegEx

I've used scans over data stored in Accumulo before, and have gotten the whole result set back (whatever Range I specified). The problem is, I would like to filter those on the server-side from Accumulo before the client receives them. I'm hoping…
Jack
  • 1,250
  • 1
  • 14
  • 26
1
2 3
17 18