Questions tagged [data-generation]
263 questions
104
votes
16 answers
Tools for Generating Mock Data?
I'm looking for recommendations of a good, free tool for generating sample data for the purpose of loading into test databases. By analogy, something that produces "lorem ipsum" text for any RDBMS. Features I'm looking for include:
Flexibility to…

Bill Karwin
- 538,548
- 86
- 673
- 828
48
votes
4 answers
Does a fake data generator exists in Java?
I would like to know if a fake data generator exists for Java. In Perl exists Data::Faker and there's a port to Ruby called faker, for JavaScript faker.js. Someone know about a fake data generator for Java, that can provide random names, phone…

Rubens Mariuzzo
- 28,358
- 27
- 121
- 148
44
votes
9 answers
Data generators for SQL server?
I would like to receive suggestions on the data generators that are available, for SQL server. If posting a response, please provide any features that you think are important.
I have never used a application like this, so I am looking to be…

Phobis
- 7,524
- 10
- 47
- 76
28
votes
5 answers
How do I generate text matching a regular expression from a regular expression?
Yup, you read that right. I needs something that is capable of generating random text from a regular expression. So the text should be random, but be matched by the regular expression. It seems it doesn't exist, but I could be wrong.
Just a an…

Wilfred Springer
- 10,869
- 4
- 55
- 69
26
votes
1 answer
Understanding `width_shift_range` and `height_shift_range` arguments in Keras's ImageDataGenerator class
The Keras documentation of ImageDataGenerator class says—
width_shift_range: Float, 1-D array-like or int - float: fraction of total width, if < 1, or pixels if >= 1. - 1-D array-like: random elements from the array. - int: integer number of pixels…

Arafat Hasan
- 2,811
- 3
- 21
- 38
11
votes
3 answers
How knowing number of images in flow_from_directory
We can generate image dataset using ImageDataGenerator with flow_from_directory method.
train_datagen = ImageDataGenerator(
rescale=1./255, #scale images from integers 0-255 to floats 0-1.
shear_range=0.2,
zoom_range=0.2, # zoom in or…

Eliza
- 584
- 4
- 14
10
votes
5 answers
How to generate Bad Random Numbers
I'm sure the opposite has been asked many times but I couldn't find any answers on how to generate bad random numbers.
I want to write a small program for cluster analysis and want to generate some random Points for testing. If I would just insert…

Nicolas
- 1,828
- 6
- 23
- 34
9
votes
6 answers
Java 8 Stream IllegalStateException: Stream has already been operated on or closed
I'm trying to generate Order instances using the Stream API. I have a factory function that creates the order, and a DoubleStream is used to initialize the amount of the order.
private DoubleStream doubleStream = new Random().doubles(50.0,…

Ole
- 41,793
- 59
- 191
- 359
8
votes
3 answers
Generate data for MySQL tables
Is there a program like Red Gate's Data Generator for MS SQL Server that generates data based on your database schema for MySQL?
Other alternatives (easy) to accomplish the objective very much welcomed!

super9
- 29,181
- 39
- 119
- 172
8
votes
3 answers
Is there a lib to generate data according to a regexp? (Python or other)
Given a regexp, I would like to generate random data x number of time to test something.
e.g.
>>> print generate_date('\d{2,3}')
13
>>> print generate_date('\d{2,3}')
422
Of course the objective is to do something a bit more complicated than that…

Bite code
- 578,959
- 113
- 301
- 329
7
votes
3 answers
How to generate bivariate data of different shapes (e.g., square, circle, rectangle) with outliers?
I am currently looking for some tool that would generate datasets of different shapes like square, circle, rectangle, etc. with outliers for cluster analysis.
Can any one of you recommend a good dataset generator for cluster analysis?
Is there…

Pradeep
- 555
- 8
- 14
7
votes
1 answer
What alternatives exist for Data Generation Plans for Visual Studio Express users?
I'm wondering if there is a good option for those of us using Express (or even Professional) editions without having to go outside the Visual Studio project (i.e. writing a ruby/perl/etc script to populate test/seed data).
If it helps I'm working…

Jeff Swensen
- 3,513
- 28
- 52
6
votes
2 answers
Generating test data - how to generate a valid address for a given US zipcode?
I am creating a tool which depends on addresses. For the purposes of testing, I'd like to create a large number of valid US addresses. I have the GeoNames postal code data and I would like to generate some number of real addresses for each of the…

Joseph
- 12,678
- 19
- 76
- 115
6
votes
2 answers
How do you statically generate float data at compile time?
Given I want to perform filtering on some data, how can I avoid generating this data at runtime, but maintain the flexibility of changing sizes and data distribution of these filters, while also keeping nice clean reusable code. I know that I can…

geoff3jones
- 605
- 1
- 7
- 17
6
votes
2 answers
Creating a Data Generation Plan that Includes Correct Lookup Table Values
I'm just getting started with Visual Studio Database projects and loving the data generation plans I can create with it. However, in one of my projects I need to populate a lookup table with specific values. Since a foreign-key relationship exists…

Kevin Babcock
- 10,187
- 19
- 69
- 89