Questions tagged [druid]

Druid is a column-oriented open-source distributed data store written in Java.

According to the Apache Druid website:

Apache Druid is a real-time analytics database designed for fast slice-and-dice analytics ("OLAP" queries) on large data sets. Most often, Druid powers use cases where real-time ingestion, fast query performance, and high uptime are important.

Druid is commonly used as the database backend for GUIs of analytical applications, or for highly-concurrent APIs that need fast aggregations. Druid works best with event-oriented data.

597 questions
11
votes
2 answers

Can druid replace hadoop?

Druid is used for both real time and batch processing. But can it totally replace hadoop? If not why? As in what is the advantage of hadoop over druid? I have read that druid is used along with hadoop. So can the use of Hadoop be avoided?
Amit Sharma
  • 133
  • 1
  • 7
10
votes
2 answers

Add a druid cluster as a SQL database in Apache Superset

I currently connect to the druid cluster through the druid connector in Apache Superset. I heard that SQL can be used to query druid. Is it possible to point my SQL database connection to druid?
Climbs_lika_Spyder
  • 6,004
  • 3
  • 39
  • 53
10
votes
3 answers

Can Druid replace Cassandra?

I cant help think that there aren't many use case that can be effectively served by Cassandra better than Druid. As a time series store or key value, queries can be written in Druid to extract data however needed. The argument here is more around…
TechJack
  • 301
  • 2
  • 7
9
votes
1 answer

How add new column in to existing druid schema?

I create a schema and i add 1TB data to druid schema. then the log file version was upgraded and new two columns was added. then i want to add that data to druid schema. but couldn't yet.
8
votes
4 answers

Is there a possibility to have another timestamp as dimension in Druid?

Is it possible to have Druid datasource with 2 (or multiple) timestmaps in it? I know that Druid is time-based DB and I have no problem with the concept but I'd like to add another dimension with which I can work as with timestamp e.g. User…
Štefan Šoóš
  • 159
  • 1
  • 8
8
votes
2 answers

Importing data from mysql to druid

I have been using mysql for all my data storage and querying. But as now the tables sizes has become so much high, it takes sometimes hours to get the results. I have taken every possible measures like optimizing queries and indexing the tables…
Joy
  • 4,197
  • 14
  • 61
  • 131
7
votes
1 answer

Does Apache Superset support Weighted Averages?

I'm trying to use Apache Superset to create a dashboard that will display the average rate of X/Y at different entities such that the time grain can be changed on the fly. However, all I have available as raw data is daily totals of X and Y for the…
JZ1987
  • 99
  • 4
7
votes
1 answer

Apache Druid sql query conversion to json based query

I am trying to convert the following druid sql query to a druid json query, as one of the columns i have is a multi-value dimension for which druid does not support a sql style query. My sql query: SELECT date_dt, source, type_labels, COUNT(DISTINCT…
Pratik Khadloya
  • 12,509
  • 11
  • 81
  • 106
6
votes
1 answer

GCP Dataproc has Druid available in alpha. How to load segments?

The dataproc page describing druid support has no section on how to load data into the cluster. I've been trying to do this using GC Storage, but don't know how to set up a spec for it that works. I'd expect the "firehose" section to have some…
radialmind
  • 279
  • 2
  • 15
6
votes
1 answer

Intersect two queries with different filters

I use Druid for monitoring events in my website. The data can be represented as follows: event_id | country | user_id | event_type ================================================ 1 | USA | id1 | visit 2 | …
orenMos
  • 71
  • 1
  • 6
6
votes
3 answers

druid vs Elasticsearch

I'm new to druid. I've already read "druid VS Elasticsearch", but I still don't know what druid is good at. Below is my problem: I have a solr cluster with 70 nodes. I have a very big table in solr which has 1 billion rows, and each row has 100…
zhouxiang
  • 153
  • 3
  • 12
5
votes
2 answers

org.apache.druid.java.util.common.ISE: No default server found

I'm setting up druid first time and ran into following issues while trying to start druid using docker-compose, 2020-04-10T14:40:01,837 ERROR [qtp1667348377-84] org.apache.druid.server.router.QueryHostFinder - Catastrophic failure! No servers found…
Mnd
  • 51
  • 1
  • 2
5
votes
2 answers

Can't create Druid ingestion task through API

When I send JSON ingestion specification to Druid overlord API I get this response: HTTP/1.1 400 Bad Request Content-Type: application/json Date: Wed, 25 Sep 2019 11:44:18 GMT Server: Jetty(9.4.10.v20180503) Transfer-Encoding: chunked { …
Artem Vovsia
  • 1,520
  • 9
  • 15
5
votes
1 answer

Apache druid No known server

I am trying to setup the Apache Druid on a single machine following quickstart guide here. When I start historical server, it shows io.druid.java.util.common.IOE: No known server exception on screen. Command: java `cat…
Rahul Sharma
  • 5,614
  • 10
  • 57
  • 91
5
votes
2 answers

List of supported data types for dimensions in Druid?

I cannot seem to find any particular tutorial/doc page on the Druid website which has a list of all supported data types in Druid for the dimensions. From how much I've read, I know that long, float and string are definitely supported, but I have…
Tarun Verma
  • 329
  • 5
  • 17
1
2 3
39 40