Questions tagged [recommendation-engine]

For questions relating to recommendation engines, collaborative filtering, and personalization. Questions tend to be algorithmic or statistical in nature.

Recommendation engines are systems that suggest (recommend) to users various items for consideration. Some of the best known real world usage occurs at many major internet sites, such as Netflix, Amazon, Google, Yahoo, Pandora, last.fm, and others.

The input of such algorithms are most commonly the past purchases of the user, the rating the user gave to other goods, the items that have been purchased together ("customer who bought this also bought..."), the browsing history, etc.

For this tag, it is recommended that users focus on mathematical or statistical clarity, as it is a particularly advanced topic and implementation can take quite a bit of computational effort.

See also wikipedia.

1468 questions
39
votes
5 answers

How to create my own recommendation engine?

I am interested in recommendation engines these days and I want to improve myself in this area. I am currently reading "Programming Collective Intelligence" I think this is the best book about this subject, from O'Reilly. But I don't have any ideas…
Burak Dede
  • 3,725
  • 5
  • 40
  • 53
29
votes
9 answers

How do recommendation systems work?

I've always been curious as to how these systems work. For example, how do netflix or Amazon determine what recommendations to make based on past purchases and/or ratings? Are there any algorithms to read up on? Just so there's no misperceptions…
Jason Baker
  • 192,085
  • 135
  • 376
  • 510
28
votes
2 answers

What's difference between item-based and content-based collaborative filtering?

I am puzzled about what the item-based recommendation is, as described in the book "Mahout in Action". There is the algorithm in the book: for every item i that u has no preference for yet for every item j that u has a preference for compute a…
cstur4
  • 966
  • 2
  • 8
  • 21
27
votes
2 answers

Using machine learning to de-duplicate data

I have the following problem and was thinking I could use machine learning but I'm not completely certain it will work for my use case. I have a data set of around a hundred million records containing customer data including names, addresses,…
26
votes
7 answers

How to train a classifier with only positive and neutral data?

My question : How to train a classifier with only positive and neutral data? I am building a personalized article recommendation system for education purposes. The data I use is from Instapaper. Datasets I only have positive data: - Articles that I…
log0
  • 541
  • 2
  • 7
  • 11
23
votes
1 answer

Recall, Recall rate@k and precision in top-k recommendation

According to authors in 1, 2, and 3, Recall is the percentage of relevant items selected out of all the relevant items in the repository, while Precision is the percentage of relevant items out of those items selected by the query. Therefore,…
19
votes
1 answer

How to evaluate a Content-based Recommender System

I'm building a content-based movie recommender system. It's simple, just let a user enter a movie title and the system will find a movie which has the most similar features. After calculating similarity and sorting the scores in descending order, I…
user691223
  • 321
  • 1
  • 3
  • 8
19
votes
1 answer

Understanding Precision@K, AP@K, MAP@K

I'm currently evaluating a recommender system based on implicit feedback. I've been a bit confused with regard to the evaluation metrics for ranking tasks. Specifically, I am looking to evaluate by both precision and recall. Precision@k has the…
apgsov
  • 794
  • 1
  • 8
  • 30
19
votes
4 answers

What should be considered when building a Recommendation Engine?

I've read the book Programming Collective Intelligence and found it fascinating. I'd recently heard about a challenge amazon had posted to the world to come up with a better recommendation engine for their system. The winner apparently produced the…
Allain Lalonde
  • 91,574
  • 70
  • 187
  • 238
19
votes
5 answers

Tag/Keyword based recommendation

I am wondering what algorithm would be clever to use for a tag driven e-commerce enviroment: Each item has several tags. IE: Item name: "Metallica - Black Album CD", Tags: "metallica", "black-album", "rock", "music" Each user has several tags and…
Hellnar
  • 62,315
  • 79
  • 204
  • 279
18
votes
3 answers

Dealing with lots of data in Firebase for a recommender system

I am building a recommender system where I use Firebase to store and retrieve data about movies and user preferences. Each movie can have several attributes, and the data looks as follows: { "titanic": {"1997": 1, "english": 1, "dicaprio":…
18
votes
4 answers

SOLR and Natural Language Parsing - Can I use it?

Requirements Word frequency algorithm for natural language processing Using Solr While the answer for that question is excellent, I was wondering if I could make use of all the time I spent getting to know SOLR for my NLP. I thought of SOLR…
andy
  • 8,775
  • 13
  • 77
  • 122
17
votes
3 answers

A good collaborative filtering/matching/recommendation library for Python/Django?

I'm looking for a library I can use to match my users to other Django models based on answers to questions-- also my own django model. So I'd like something customizable, with good documentation/support, and hopefully not too hard to implement! Does…
Colleen
  • 23,899
  • 12
  • 45
  • 75
17
votes
4 answers

How can I implement a recommendation engine?

Please be patient with my writing, as my English is not proficient. As a programmer, I wanna learn about the algorithm, or the machine learning intelligence, that are implemented underneath recommendation systems or related-based systems. For…
HTa
  • 171
  • 1
  • 3
17
votes
4 answers

Why is the Netflix Prize so challenging?

Having just read the recent article in Wired, I'm curious: what is it about the Netflix Prize that's so challenging? I mean this in the sincerest way possible, I'm just curious about the difficulties posed by the contest. Are most recommendation…
Alex Basson
  • 10,967
  • 6
  • 29
  • 44
1
2 3
97 98