Column-oriented database based off MySQL which leverages an in-memory knowledge-grid. The technology is best used as an analytic database for serving large amounts of machine-generated data. It is not designed as a transactional database. Best data amounts exist between hundreds of gigs to fifty terabytes. Schema is best structured when it is denormalized. The product exists in two forms: a gpl2 open-source edition and an enterprise edition.
Questions tagged [infobright]
41 questions
19
votes
4 answers
Time and date dimension in data warehouse
I'm building a data warehouse. Each fact has it's timestamp. I need to create reports by day, month, quarter but by hours too. Looking at the examples I see that dates tend to be saved in dimension tables.
(source: etl-tools.info)
But I think,…

Piotr Gwiazda
- 12,080
- 13
- 60
- 91
7
votes
2 answers
Single transaction using multiple connections. (MYSQL/JDBC)
The application I'm working on is a Java-based ETL process that loads data into multiple tables. The DBMS is Infobright (a MYSQL-based DBMS geared for data warehousing).
The data loading should be done atomically; however, for performance reasons, I…

Filip
- 1,451
- 1
- 11
- 19
5
votes
2 answers
Data storage for financial analysis
I am building system to analyze large quantities of financial data regarding securities trading prices. A large challenge in this is determining what storage method to use for the data given that the data will be in the 10's of terrabytes. There…

user396404
- 2,759
- 7
- 31
- 42
5
votes
9 answers
Why do Column oriented databases such as Vertica/InfoBright/GreenPlum make a fuss of Hadoop?
What is the point in feeding an Hadoop cluster and using that cluster to feed data into a Vertica/InfoBright datawarehouse ?
All thse vendor keep saying "we can connect with Hadoop", but I don't understand what's the point. What is the interest of…

SCO
- 1,832
- 1
- 24
- 45
5
votes
3 answers
Free data warehouse - Infobright, Hadoop/Hive or what?
I need to store large amount of small data objects (millions of rows per month). Once they're saved they wont change. I need to :
store them securely
use them to analysis (mostly time-oriented)
retrieve some raw data occasionally
It would be nice…

Piotr Gwiazda
- 12,080
- 13
- 60
- 91
5
votes
6 answers
How to tackle a BIG DATA Data Mart / Fact Table? ( 240 millions of rows )
We have a BI customer that generates about 40 Millions of rows every month in their sales data base tables, generated from their sales transactions. They want to build a Sales Data Mart with their historic data from 5 years, meaning that this fact…

Eduardo Williams
- 51
- 1
- 5
4
votes
9 answers
database for analytics
I'm setting up a large database that will generate statistical reports from incoming data.
The system will for the most part operate as follows:
Approximately 400k-500k rows - about 30 columns, mostly varchar(5-30) and datetime - will be uploaded…

rfusca
- 7,435
- 2
- 30
- 34
3
votes
1 answer
Mixing column and row oriented databases?
I am currently trying to improve the performance of a web application. The goal of the application is to provide (real time) analytics. We have a database model that is similiar to a star schema, few fact tables and many dimensional tables. The…

enricog
- 4,226
- 5
- 35
- 54
2
votes
2 answers
max no of columns in infobright
We store billions of rows in an infobright table which currently has about 45 columns. We want to add 50 more columns to it. Will adding these columns bring down the performance of reads? Is creating a new table for these columns a better option?…

hese
- 3,397
- 8
- 25
- 34
2
votes
2 answers
PDI (Kettle) looped job step parellelization
Preface
I have automated and scripted the creation of individual .ktr files to handle the extraction and syncing of data between Source (MySQL) and Target (InfoBright) databases. One .ktr file is created for each table.
I have a set of 2 Jobs and 2…

Noah
- 1,857
- 15
- 19
2
votes
1 answer
Trying to figure out SQL query for monthly user churn based on an activity threshold
I have a table (we're on InfoBright columnar storage and I use MySQL Workbench as my interface) that essentially tracks users and a count of activities with a datestamp. It's a daily aggregate table. Schema is essentially
userid…

no_longer_a_techie
- 21
- 2
2
votes
3 answers
MySQL Select Into Outfile Without Quotes
Is it possible, and if so how, can I SELECT ... INTO OUTFILE and have it not enclose with any character.
So far this doesn't work:
SELECT hour_stamp,
day_stamp,
month_stamp,
hour,
day,
month,
…

Russ Bradberry
- 10,705
- 17
- 69
- 85
1
vote
5 answers
How to connect infobright DB through perl?
How to connect infobright DB through perl ?

Tree
- 9,532
- 24
- 64
- 83
1
vote
0 answers
InfoBright MYSQL Daily crashes for the last 4 days
From last 4 days we are facing frequent database crashes with mysql infobright engine, there is no recent changes on production environment and no updates.
Currently we are using the version 5.1.40.
Find the below dump, can any one help to…

RAVITEJA DUVVA
- 11
- 1
1
vote
1 answer
Accessing infobright from Java
Would someone advise me technology for accessing Infobright from java.
As far as I understand I should use plain jdbc connection and execute queries, not making use of high level thing like hibernate. Am I right?

Andrew
- 2,663
- 6
- 28
- 50