Questions tagged [accumulo]

The Apache Accumulo sorted, distributed key/value store is based on Google's BigTable design. It is built on top of Apache Hadoop, ZooKeeper, and Thrift. It features a few novel improvements on the BigTable design in the form of cell-level access labels and a server-side programming mechanism that can modify key/value pairs at various points in the data management process.

The Apache Accumulo sorted, distributed key/value store is a robust, scalable, high performance data storage and retrieval system. Apache Accumulo is based on Google's BigTable design and is built on top of Apache Hadoop, ZooKeeper, and Thrift. Apache Accumulo features a few novel improvements on the BigTable design in the form of cell-based access control and a server-side programming mechanism that can modify key/value pairs at various points in the data management process. Other notable improvements and feature are outlined here.

Google published the design of BigTable in 2006. Several other open source projects have implemented aspects of this design including HBase, Hypertable, and Cassandra. Accumulo began its development in 2008 and joined the Apache community in 2011.

Official Website: https://accumulo.apache.org/

268 questions

votes

4 answers

NoSQL for time series/logged instrument reading data that is also versioned

My Data It's primarily monitoring data, passed in the form of Timestamp: Value, for each monitored value, on each monitored appliance. It's regularly collected over many appliances and many monitored values. Additionally, it has the quirky feature…

asked Jun 23 '12 at 02:59

jsmith54

votes

2 answers

problems while configuring accumulo

In my system Hadoop and zookeeper working properly. Now I have just configured Accumulo.But when i am going to initialize Accumulo by accumulo init it is showing following errors. [root@hydDev32 bin]# ./accumulo init Uncaught exception: Provider…

accumulo

asked Jul 06 '12 at 10:52

Rajesh Barri

votes

1 answer

Accumulo high speed ingest options

In a nutshell, I have a customer who wants the data contained in a bunch of ASCII text files (a.k.a "input files") ingested into Accumulo. These files are output from diverse data feed devices and will be generated continuously on…

performance hadoop accumulo

asked Feb 11 '14 at 22:52

jhop

votes

1 answer

Hadoop/YARN job FAILED - "exited with exitCode: -1000 due to: Could not find any valid local directory for nmPrivate..."

I am trying to run a MapReduce job with Hadoop, YARN and Accumulo. I am getting the following output that I cannot track down the issue. Looks to be a YARN issue, but I am not sure what it is looking for. I have a nmPrivate folder at location…

hadoop hadoop-yarn accumulo

asked Mar 31 '14 at 14:12

bdparrish

3,216
3
37
58

votes

2 answers

How do I create a Spark RDD from Accumulo 1.6 in spark-notebook?

I have a Vagrant image with Spark Notebook, Spark, Accumulo 1.6, and Hadoop all running. From notebook, I can manually create a Scanner and pull test data from a table I created using one of the Accumulo examples: val instanceNameS = "accumulo" val…

scala hadoop apache-spark accumulo spark-notebook

asked Mar 24 '15 at 23:03

snerd

1,238
1
14
28

votes

1 answer

Getting error while running storm connection with accumulo

I have Storm Bolt like follows, package storm.bolt; import java.util.Map; import org.apache.accumulo.core.client.AccumuloException; import org.apache.accumulo.core.client.AccumuloSecurityException; import…

apache-storm accumulo

asked Jul 07 '14 at 04:50

Sreejith

votes

2 answers

When it comes to mapreduce how are the Accumulo tablets mapped to an HDFS block

If my environment set up is as follows: -64MB HDFS block -5 tablet servers -10 tablets of size 1GB each per tablet server If I have a table like below: rowA | f1 | q1 | v1 rowA | f1 | q2 | v2 rowB | f1 | q1 | v3 rowC | f1 | q1 | v4 rowC | f2 | q1…

hadoop mapreduce accumulo

asked Dec 04 '12 at 22:08

chapstick

votes

2 answers

How to remove instance of accumulo?

I have created a instance while initializing accumulo by calling accumulo init But now i want to remove that instance and as well i want to create a new instance. Can any one help to do this?

accumulo

asked Aug 06 '12 at 08:19

Rajesh Barri

votes

1 answer

Connecting to Accumulo inside a Mapper using Kerberos

I am moving some software from an older Hadoop Cluster (uses username/password authentication) to a newer one, 2.6.0-cdh5.12.0 which has Kerberos authentication enabled. I have been able to get many of existing Map/Reduce jobs that use Accumulo for…

hadoop kerberos accumulo

asked Oct 18 '17 at 14:22

Glen Warholic

votes

2 answers

Accumulo init - [start.Main] ERROR: initializing the class loader

I'm new to Accumulo and trying to install v1.7 on a Cloudera VM. I have Java 1.7 and HDP 2.2, and Zookeeper is currently running. I've mainly been trying to follow the INSTALL.md without incident and have configured Accumulo however get the…

cloudera accumulo

asked Feb 23 '16 at 17:50

jhole89

votes

1 answer

How to get count for database query in Accumulo

Every database I've ever seen has a method for retrieving the count of the query prior to actually executing it. But I can't figure how to do this simple task in Accumulo. Just for clarity, I want the Accumulo analog of this Mongo feature. I checked…

java accumulo

asked Jan 06 '16 at 19:07

Mike S

11,329
6
41
76

votes

1 answer

ZooKeeperBindException when starting MiniAccumuloCluster

I'm attempting to start up a MiniAccumuloCluster for testing as described in the Accumulo Docs: TemporaryFolder folder= new TemporaryFolder(); File temp_dir = folder.newFolder("AccumuloTempFolder"); MiniAccumuloCluster accumulo = new…

java apache-zookeeper accumulo

asked Jul 13 '15 at 16:11

bkaiser

votes

1 answer

How do I set an environment variable in a YARN Spark job?

I'm attempting to access Accumulo 1.6 from an Apache Spark job (written in Java) by using an AccumuloInputFormat with newAPIHadoopRDD. In order to do this, I have to tell the AccumuloInputFormat where to locate ZooKeeper by calling the…

java apache-spark hadoop-yarn cloudera-cdh accumulo

asked Oct 10 '14 at 19:27

Trebor Rude

1,904
1
21
31

votes

2 answers

Accumulo Overview console not reachable outside of VirtualBox VM

I am running Accumulo 1.5 in an Ubuntu 12.04 VirtualBox VM. I have set the accumulo-site.xml instance.zookeeper.host file to the VM's IP address, and I can connect to accumulo and run queries from a remote client machine. From the client machine, I…

virtualbox accumulo

asked Nov 19 '13 at 18:00

J21042

1,214
4
19
33

votes

1 answer

How to filter Scan on Accumulo using RegEx

I've used scans over data stored in Accumulo before, and have gotten the whole result set back (whatever Range I specified). The problem is, I would like to filter those on the server-side from Accumulo before the client receives them. I'm hoping…

regex accumulo

asked Jan 16 '13 at 18:29

Jack

1,250
1
14
26

2 3

…

17 18 Next