Is there any way to import SPSS dataset into Python, preferably NumPy recarray format? I have looked around but could not find any answer.
Joon
Option 1 As rkbarney pointed out, there is the Python savReaderWriter available via pypi. I've run into two issues:
Option 2 I've chosen to use R as a middle-man. Using rpy2, I set up a simple function to read the file into an R data frame and output it again as a CSV file which I subsequently import into python. It's a bit rube-goldberg but it works. Of course, this requires R which may also be a hassle to install in your environment (and has different binaries for different platforms).
SPSS has an extensive integration with Python, but that is meant to be used with SPSS (now known as IBM SPSS Statistics). There is an SPSS ODBC driver that could be used with Python ODBC support to read a sav file.
gretl claims to import SPSS and export in a variety of formats, as does the R statistical suite. I've never dealt with SPSS data so cannot speak to their relative merits.
You could have Python make an external call to spssread, a Perl script that outputs the content of SPSS files in the way you want.
Maybe this will help: Python reader + writer for spss sav files (Linux, Mac & Windows) http://code.activestate.com/recipes/577811-python-reader-writer-for-spss-sav-files-linux-mac-/
Maybe this will be helpful for someone:
http://sourceforge.net/search/?q=python+SPSS
good luck!
Michal