15

When using scipy.optimize's fmin I'm getting an error I don't understand:

ValueError: setting an array element with a sequence.

Here's a simple squared error example to demonstrate:

import numpy as np
from scipy.optimize import fmin

def cost_function(theta, X, y):    
    m = X.shape[0]
    error = X.dot(theta) - y 
    J = 1/(2*m) * error.T.dot(error)  
    return J

X = np.array([[1., 1.],
              [1., 2.],
              [1., 3.],
              [1., 4.]])

y = np.array([[2],[4],[6],[8]])   
initial_theta = np.ones((X.shape[1], 1)) * 0.01

# test cost_function
print cost_function(initial_theta, X, y)
# [[ 14.800675]] seems okay...

# but then error here...   
theta = fmin(cost_function, initial_theta, args=(X, y))

#Traceback (most recent call last):
#  File "C:\Users\me\test.py", line 21, in <module>
#    theta = fmin(cost_function, initial_theta, args=(X, y))
#  File "C:\Python27\lib\site-packages\scipy\optimize\optimize.py", line 278, in fmin
#    fsim[0] = func(x0)
#ValueError: setting an array element with a sequence.

I'd be grateful for any help to explain where I'm going wrong.

Kim
  • 3,316
  • 5
  • 25
  • 28
  • 4
    You're returning an array for the cost function, and `fmin` expects a single value. Either return `J[0,0]` or rewrite your cost function as `J = 1/(2*m) * np.sum(error**2)` – Joe Kington Mar 20 '12 at 00:43

2 Answers2

9

The reason is that the starting point (initial_theta) you gave to fmin is not a 1D array but a 2D array. So on a second iteration fmin passes a 1D array (that's how it supposed to work) and the result becomes non-scalar.

So you should refactor your cost function to accept 1d arrays as a first argument.

The simplest change is to make the code working is to flatten the initial_theta before passing to fmin and reshape theta inside cost_function to (X.shape[1],1) if you like.

sega_sai
  • 8,328
  • 1
  • 29
  • 38
  • Thanks for the explanation. Yes, the number of dimensions was confusing me. So, I've now initially set both theta and y to one dim, and the cost function return value works out as a scalar without any further adjustment. Cheers. – Kim Mar 21 '12 at 17:53
7

cost_function should return a scalar, but your return value J is an array of some kind.

YXD
  • 31,741
  • 15
  • 75
  • 115