Questions tagged [distributed-database]

Anything related to distributed databases and the techniques and the tools used to manage them. A distributed database is a database whose information is not stored in a single physical location, but it is spread over various devices, often placed far apart.

Anything related to distributed databases and the techniques and the tools used to manage them. A distributed database is a database whose information is not stored in a single physical location, but it is spread over various devices, often placed far apart.

188 questions
194
votes
15 answers

What scalability problems have you encountered using a NoSQL data store?

NoSQL refers to non-relational data stores that break with the history of relational databases and ACID guarantees. Popular open source NoSQL data stores include: Cassandra (tabular, written in Java, used by Cisco, WebEx, Digg, Facebook, IBM,…
knorv
  • 49,059
  • 74
  • 210
  • 294
59
votes
3 answers

What is semi-join in database?

I am having trouble while trying to understand the concept of semi-join and how it is different from conventional join. I have tried some article already but not satisfied with the explanation, could someone please help me to understand it?
Henu
  • 1,622
  • 2
  • 22
  • 27
16
votes
10 answers

Need a distributed key-value lookup system

I need a way to do key-value lookups across (potentially) hundreds of GB of data. Ideally something based on a distributed hashtable, that works nicely with Java. It should be fault-tolerant, and open source. The store should be persistent, but…
sanity
  • 35,347
  • 40
  • 135
  • 226
11
votes
1 answer

Amazon DynamoDB Conditional Writes and Atomic Counters

The application im working on currently requires me to increment an attribute belonging to an item in DynamoDB many times in a 20 to 30 minute period. I've been doing some additional reading about DynamoDBs conditional writes and atomic…
john
  • 709
  • 3
  • 13
  • 25
7
votes
2 answers

How can MySQL Cluster 7.3 achieve 99,999% Availability? Antithesis to CAP Theorem

According to the "Guide to Scaling Web Databases with MySQL Cluster", MySQL Cluster 7.3 can acchieve 99,999% availability while using synchronous update replication. This would be a antithesis to the CAP Theorem since it states that perfect…
NorRen
  • 711
  • 2
  • 9
  • 22
5
votes
4 answers

Presto vs Impala: architecture, performance, functionality

Could you highligh major differences between the two in architecture & functionality in 2019? And how that differences affect performance? For some reason this excellent question was tagged as opinion-based. Extra-question: why Amazon decide to go…
VB_
  • 45,112
  • 42
  • 145
  • 293
5
votes
2 answers

What node does Cassandra store data on?

Is there a command or any way at all to know what data is stored on what nodes of Cassandra? Im pretty new to Cassandra and haven't had much luck googling this question. Thanks!
user3376961
  • 867
  • 2
  • 12
  • 17
4
votes
1 answer

Looking for a mature, scalable GraphDB with .NET or C++ binding

My basic requirements from a GraphDB: Mature (production-ready) Native .NET or C++ language binding Horizontal scalability: both Automated data redundancy and sharding Distributed graph algorithms / query execution Currently I disqualified the…
Lior Kogan
  • 19,919
  • 6
  • 53
  • 85
4
votes
1 answer

Scaling TerminusDB to multiple servers

Can TerminusDB run in a multi-server environment? Either with a Sharded and Clustered database mode Or can it run on Distributed database scheme? Or it can only run as a single process and does not horizontally scale? If it can scale horizontally,…
quarks
  • 33,478
  • 73
  • 290
  • 513
4
votes
1 answer

Aerospike: How Primary & Secondary Index works internally

We are using Aerospike DB and was going through the documentation. I could not find good explanation of algorithm explaining how Primary & Secondary index works. The documentation says it uses some sort of distributed hash + B Tree. Could someone…
Tarun
  • 3,162
  • 3
  • 29
  • 45
4
votes
0 answers

Can I use bigchainDB server with django instead of using sqlite?

I am creating degree verification process using blockchain approach which contain six main entities. By entities I mean to say consensus mechanism will evolve around these six entities, so for this I need to build a distributed database. Two…
4
votes
3 answers

Data allocation in the distributed databases

How to optimize a data allocation in the distributed database? Are there any software products for solving this problem? For example: There are some number of connected servers for the distributed database. Each server simultaneously is a client of…
4
votes
2 answers

Looking for distributed, in-memory Graph DB

Is there a database which will satisfy all (or at least most) of this requirements? Graph oriented - optimized for storing graphs and traversal(e.g HyperGraphDB, Neo4j) Running in memory, but having a persisted storage (e.g. Redis) Distributed…
Insomniac
  • 3,354
  • 2
  • 23
  • 24
4
votes
0 answers

How to shard using OrientDB

How to achieve sharding on OrientDB? Suppose I have three nodes viz. node1, node2 and node3. I have two clusers viz. zip_india, zip_usa. Now I tried to set the servers up such that zip_india will be on node1 and node2 and zip_usa will be on node3.…
Abhilash Panigrahi
  • 1,455
  • 1
  • 13
  • 31
4
votes
2 answers

steps for making a non-distributed db to distributed db [talking about lmdb specifically]

I am dealing with lmdb and lmdb is really fast . But one problem with it is lmdb isn't distributed . Everytime i need to do some maintenance , lmdb has to taken down and sites is unavailable for few hours . I researching on making lmdb a…
igauravsehrawat
  • 3,696
  • 3
  • 33
  • 46
1
2 3
12 13