This is a loaded question and it's my first 'real life' machine learning experiment so bare with the simplistic questions.
I have USPTO bulk data that looks like this in a CSV file:
Name Class Subclass Category Subcategory
Lightpack circuitboard E 1 4 9
Lego blocks F 2 56 12
D/C connector E 3 4 1
Colorful dog hat D 6 10 1
Grandma's shoes D 2 11 1
Low temp resistor O 2 4 10
What I want is to be able to run a supervised machine learning environment to group the common objects (there are many more than this in my actual data but this is a simple example). I want to be able to run through to find a common set of class, subclass, category, and subcategory amongst all electronics and to group them as such into an electronics 'bin' (ie: Lightpack circuitboard, D/C connector, and Low temp resistor) but am unsure how to proceed.
Currently I'm using Python and sklearn to do my more simplistic modeling but am unsure of how to test and train under 4 parameters given and I have no labeled set to compare to (no validation).
Would creating a pseudo-labeled set to make it supervised be more advised or is there an unsupervised approach I could take? As I said before this is my first real test in ML.