I'm working on a machine learning problem in which I've a multi-label target where each label is a probability. In the past I've worked with multi-label problems, but each label was binary. For example, if the target was a series of book topics ('Python', 'Machine Learning', 'Fairy Tales', 'Cooking') a Machine Learning book based on Python's scikit learn would have a target of [1, 1, 0, 0].
Now, for the problem I'm trying to solve, my target are not binary. The target itself is a series of probabilities like [0.75, 0.25, 0, 0]. I think the target was produced in a crowd sourcing fashion, and these probabilities reflect the variability of people's judgment. So, unless I want to bucket probabilities in classes (i.e p<0.5 ->0, p>=0.5 ->1), I'm stuck with a regression problem where the target needs to be constrained between 0 and 1. Any ideas of what type of algorithm I could try? I'm using Python's scikit learn.
Thanks!