Questions tagged [ycsb]

Yahoo! Cloud Serving Benchmark (YCSB): framework and common set of workloads for evaluating the performance of different "key-value" and "cloud" serving stores.

The goal of the Yahoo! Cloud Serving Benchmark (YCSB) project is to develop a framework and common set of workloads for evaluating the performance of different "key-value" and "cloud" serving stores.

The project comprises two areas:

  • The YCSB Client, an extensible workload generator;
  • The Core workloads, a set of workload scenarios to be executed by the generator;

Although the core workloads provide a well-rounded picture of a system's performance, the Client is extensible so that you can define new and different workloads to examine system aspects, or application scenarios, not adequately covered by the core workload. Similarly, the Client is extensible to support benchmarking different databases. Although we include sample code for benchmarking HBase and Cassandra, it is straightforward to write a new interface layer to benchmark your favorite database.

A common use of the tool is to benchmark multiple systems and compare them. For example, you can install multiple systems on the same hardware configuration, and run the same workloads against each system. Then you can plot the performance of each system (for example, as latency versus throughput curves) to see when one system does better than another.

You can find the source code and binaries to benchmark your hadoop environment from github. It is not perfect for newer versions of hadoop, so you can clone the repository, suggest improvements and commit your changes.

122 questions
13
votes
2 answers

YCSB for Cassandra 3.0 Benchmarking

I have a cassandra ubuntu visual cluster and need to benchmark it. I try to do it with yahoo's ycsb (without use of maven if possible). I use cassandra 3.0.1 but I cant find a suitbale version of ycsb. I dont want to change to an oldest version of…
filby
  • 378
  • 3
  • 11
8
votes
2 answers

Why does MongoDB perform better with multi-threaded client compared to a single threaded client?

We recently benchmarked Oracle 10g and MongoDB with YCSB ( https://github.com/brianfrankcooper/YCSB/wiki ), when we tried to increase the number of threads for 1,000,000 datasets, Oracle's performance remained constant after 4 threads however…
vikasing
  • 11,562
  • 3
  • 25
  • 25
6
votes
3 answers

Speeding up HBase read response

I have 4 nodes HBase v0.90.4-cdh3u3 cluster deployed on Amazon XLarge instances (16Gb RAM, 4 cores CPU) with 8Gb heap -Xmx allocated for HRegion servers, 2Gb for datanodes. HMaster\ZK\Namenode is on the separate XLarge instance. Target dataset is…
S B
  • 61
  • 1
  • 4
5
votes
1 answer

Zipfian vs Uniform - What's the difference between these two YCSB distribution?

Can anyone please describe the differences between Zipfian and Uniform distribution while running YCSB workloads ? Here's the YCSB core properties: https://github.com/brianfrankcooper/YCSB/wiki/Core-Properties
3
votes
2 answers

YCSB JDBS driver: java.lang.ClassNotFoundException

While trying to load data to MySQL I am getting following error: ~/YCSB$ bin/ycsb load jdbc -P workloads/workloada -P db.properties [WARN] Running against a source checkout. In order to get our runtime dependencies we'll have to invoke Maven.…
abouyahya85
  • 115
  • 9
3
votes
4 answers

Maven Building Error

When I try to run mvn clean package I am getting the following error Downloading: http://repo.maven.apache.org/maven2/org/apache/maven /plugins/maven-clean-plugin/2.4.1/maven-clean-plugin-2.4.1.pom [INFO]…
Rohit
  • 635
  • 6
  • 12
  • 22
3
votes
0 answers

Unable to run YCSB successfully for ElasticSearch

I am new to both YCSB and ElasticSearch. I was able to run YCSB easily for Cassandra. However, I have not been able to do the same with ES (or perhaps I have but I am not sure). Following the steps documented in YCSB/elasticsearch, I was able to…
vaidik
  • 2,191
  • 1
  • 16
  • 22
2
votes
1 answer

YCSB - why I can never find a shard configuration anywhere on internet

I see all kinds of reference to MongoDB as a client for the YCSB benchmarks to test NoSQL database server scalability / elasticity. https://github.com/brianfrankcooper/YCSB However, it is clear that the benchmark would require some kind of…
Robert
  • 368
  • 2
  • 8
2
votes
0 answers

YCSB full table scan on HBase?

I want to analyze the full table scan performance of HBase, but after reading the source code of YCSB, I find that the scan length is randomly generated. Can YCSB carry out a full table scan of HBase table to analyze the performance? Thanks~
hac
  • 31
  • 1
  • 6
2
votes
1 answer

Running TPC-C (or YCSB) benchmark on KV-store

I just created a distributed key-value store (written in C++) for a grad research project, and need to test it with some standard db benchmarks. I would like to run the TPC-C and YCSB benchmarks on it. They key-value store provides ACID…
aspen100
  • 965
  • 11
  • 22
2
votes
1 answer

How does Cassandra handle errors? Will it retry or fail a request when some nodes are down?

I'm running YCSB on a 6-node Cassandra cluster with default settings. Assuming that the client has built connection with the coordinator, and found sufficient replicas to meet its consistency level, what will happen if: (1) the coordinator is down?…
roymaztang
  • 65
  • 5
2
votes
1 answer

Generating high volume and velocity data with Yahoo Cloud Serving Benchmark

YSCB offers different type of workloads. how can i simulate real time network environment using Yahoo Cloud Serving Benchmark where: we can expect communication rate can decrease or increase based on a predefined distribution size of packets or…
kashif
  • 41
  • 6
2
votes
0 answers

error when using ycsb tool for mongodb

i am trying to test mongodb using ycsb tool.i installed java and maven and i have downloaded ycsb-master from https://github.com/brianfrankcooper/YCSB i run mvn clean package command so that mongodb binding is successful.To load the workload i…
2
votes
0 answers

Why insert latency increases with increasing no. of shards in mongodb?

I am using mongodb 2.6. In that I have observed that write/insert latency increases when no. of shards increase. I have used ycsb to test above behavior and sharding is hash based on primary key _id. Results are like, no. of shards insert latency…
Nachiket Kate
  • 8,473
  • 2
  • 27
  • 45
2
votes
1 answer

How to interprete this YCSB CouchDB benchmark?

I recently ran YCSB benchmarks on CouchDB with 2 different workloads. Both workloads were using a database containing 500.000 documents and both of them executed 100.000 operations. The distribution of operations for each workload was: Workload…
Anton Horst
  • 263
  • 1
  • 4
  • 11
1
2 3
8 9