Questions tagged [data-visualization]

Data visualization is the study of the visual representation of data. Maps and charts are the most common types of data visualization, but there are many more.

Data visualization (or data visualisation) is the creation and study of the visual representation of data, and is closely related to information graphics, information visualization, scientific visualization, and statistical graphics.

In recent years, data visualization has become an active area of research, teaching and development.

There are different approaches on the scope of data visualization. One common focus is on information presentation. From a computer science perspective data visualization is comprised of:

  • Visualization algorithms and techniques
  • Volume visualization
  • Information visualization
  • Multiresolution methods
  • Modelling techniques
  • Interaction techniques and architectures
  • Infographics

There are several open source libraries that can be used to produce data visualizations including and for Python and for JavaScript.


Useful links


Related tags

7470 questions
944
votes
11 answers

How to make IPython notebook matplotlib plot inline

I am trying to use IPython notebook on MacOS X with Python 2.7.2 and IPython 1.1.0. I cannot get matplotlib graphics to show up inline. import matplotlib import numpy as np import matplotlib.pyplot as plt %matplotlib inline I have also tried…
Ian Fiske
  • 10,482
  • 3
  • 21
  • 20
358
votes
11 answers

How do I convert a numpy array to (and display) an image?

I have created an array thusly: import numpy as np data = np.zeros( (512,512,3), dtype=np.uint8) data[256,256] = [255,0,0] What I want this to do is display a single red dot in the center of a 512x512 image. (At least to begin with... I think I can…
jlswint
  • 3,683
  • 2
  • 16
  • 4
349
votes
19 answers

Plot correlation matrix using pandas

I have a data set with huge number of features, so analysing the correlation matrix has become very difficult. I want to plot a correlation matrix which we get using dataframe.corr() function from pandas library. Is there any built-in function…
203
votes
12 answers

How can I run Tensorboard on a remote server?

I'm new to Tensorflow and would greatly benefit from some visualizations of what I'm doing. I understand that Tensorboard is a useful visualization tool, but how do I run it on my remote Ubuntu machine?
189
votes
14 answers

Workflow for statistical analysis and report writing

Does anyone have any wisdom on workflows for data analysis related to custom report writing? The use-case is basically this: Client commissions a report that uses data analysis, e.g. a population estimate and related maps for a water district. The…
forkandwait
  • 5,041
  • 7
  • 23
  • 22
168
votes
5 answers

How to plot a histogram using Matplotlib in Python with a list of data?

How do I plot a histogram using matplotlib.pyplot.hist? I have a list of y-values that correspond to bar height, and a list of x-value strings. Related: matplotlib.pyplot.bar.
DataVizGuys
  • 1,689
  • 2
  • 10
  • 3
159
votes
4 answers

Moving x-axis to the top of a plot in matplotlib

Based on this question about heatmaps in matplotlib, I wanted to move the x-axis titles to the top of the plot. import matplotlib.pyplot as plt import numpy as np column_labels = list('ABCD') row_labels = list('WXYZ') data = np.random.rand(4,4) fig,…
Jason Sundram
  • 12,225
  • 19
  • 71
  • 86
117
votes
1 answer

ggplot2 plot area margins?

Is there an easy way to increase the space between the plot title and plot area below it (i.e., the box with the data). Similarly, I'd prefer to have some space between the axis title and axis labels. In other words, is there a way to "move the…
KT.
  • 10,815
  • 4
  • 47
  • 71
105
votes
3 answers

A good book for learning D3.js

I saw D3.js and I was interested so I bought this book! I may be wrong, however I found it un-fulfilling. Does anyone have prime material for mastering D3.jsfor data visualization? I can see a lot of potential for D3.jsand I am very keen and…
laycat
  • 5,381
  • 7
  • 31
  • 46
105
votes
4 answers

Heatmap in matplotlib with pcolor?

I'd like to make a heatmap like this (shown on FlowingData): The source data is here, but random data and labels would be fine to use, i.e. import numpy column_labels = list('ABCD') row_labels = list('WXYZ') data = numpy.random.rand(4,4) Making…
Jason Sundram
  • 12,225
  • 19
  • 71
  • 86
91
votes
7 answers

GUI for statsd data other than Graphite?

I just installed graphite/statsd for production use. I'm really happy with it, but one of my co-workers asked me if there was a way to make it look prettier. Honestly, I can't say that I haven't wonder the same. Are there alternatives to the…
Blake Taylor
  • 9,217
  • 5
  • 38
  • 41
90
votes
2 answers

Construct a manual legend for a complicated plot

I cannot figure out how to manually set up a legend for this plot. All I really want is a simple legend to the right that uses the three colors and has a name next to each. The current code looks like this: a…
Torvon
  • 1,113
  • 1
  • 10
  • 12
78
votes
2 answers

ggplot2, change title size

I would like to have my main title and axis title have the same font size as the annotated text in my plot. i used theme_get() and found that text size is 12, so I did that in my theme statement - this did not work. I also tried to send the relative…
user1375871
  • 1,199
  • 1
  • 12
  • 23
77
votes
6 answers

Tree plotting in Python

I want to plot trees using Python. Decision trees, Organizational charts, etc. Any library that helps me with that?
Injeniero Barsa
  • 1,085
  • 2
  • 10
  • 12
77
votes
6 answers

Use index in pandas to plot data

I have a pandas-Dataframe and use resample() to calculate means (e.g. daily or monthly means). Here is a small example. import pandas as pd import numpy as np dates = pd.date_range('1/1/2000', periods=100) df = pd.DataFrame(np.random.randn(100,…
paulchen
  • 1,009
  • 1
  • 10
  • 17
1
2 3
99 100