Beginner’s Guide To Building A Song Recommender In Python

The number of songs available exceeds the listening capacity of an individual in their lifetime. It is tedious for an individual to sometimes to choose from millions of songs and there is also a good chance missing out on songs which could have been the favourites.

Music service providers like Spotify need an efficient way to manage songs and help their customers to discover music by giving a quality recommendation. For building this recommendation system, they deploy machine learning algorithms to process data from a million sources and present the listener with the most relevant songs.

There are mainly three types of recommendation system: content-based, collaborative and popularity.

The content-based system predicts what a user like based on what that user like in the past. The collaborative based system predicts what a particular user like based on what other similar users like.

The problem with popularity based recommendation system is that the personalisation is not available with this method i.e. even if the behaviour of the user is known, a personalised recommendation cannot be made.

Here we illustrate a naive popularity based approach and a more customised one using Python:

# Importing essential libraries #

import pandas as pd

from sklearn.model_selection

import train_test_split

import numpy as np

import timefrom sklearn.externals

import joblib

import Recommenders as Recommenders

# Download this file into your source code directory#

import Evaluation as Evaluation

#The following lines will download the data directly#

triplets_file = 'https://static.turi.com/datasets/millionsong/10000.txt'

songs_metadata_file = 'https://static.turi.com/datasets/millionsong/song_data.csv'

song_df_1 = pd.read_csv(triplets_file, header=None, sep = "\t")

#in the above line the separator is a TAB hence \t otherwise the file is read as single column#

song_df_1.columns = ['user_id', 'song_id', 'listen_count']

print(song_df_1)

#Read song metadata

song_df_2 = pd.read_csv(songs_metadata_file)

#Merge the two dataframes

song_df = pd.merge(song_df_1, song_df_2.drop_duplicates(['song_id']), on="song_id", how="left")

song_df.head()

len(song_df)

ong_df = song_df.head(10000)

#CREATING A SUBSET FROM THE DATASET#

#Merge song title and artist_name columns to make a merged column

song_df['song'] = song_df['title'].map(str) + " - " + song_df['artist_name']

song_grouped = song_df.groupby([‘song’]).agg({‘listen_count’: ‘count’}).reset_index()

grouped_sum = song_grouped[‘listen_count’].sum()

song_grouped[‘percentage’] = song_grouped[‘listen_count’].div(grouped_sum)*100

song_grouped.sort_values([‘listen_count’, ‘song’], ascending = [0,1])

# TRAINING AND TESTING THE DATA#

train_data, test_data = train_test_split(song_df, test_size = 0.20, random_state=0)

print(train_data.head(5))

#CREATING AN INSTANCE BASED ON POPULARITY#

pm = Recommenders.popularity_recommender_py()

pm.create(train_data, ‘user_id’, ‘song’)

#PREDICTING#

user_id = users[5]

pm.recommend(user_id)

#CREATING A CLASS FOR SONG SIMILARITY#

is_model = Recommenders.item_similarity_recommender_py()

is_model.create(train_data, 'user_id', 'song')

#RECOMMENDATION#

user_id = users[9]

user_items = is_model.get_user_items(user_id)

for user_item in user_items:

print(user_item)

#GET SIMILAR SONGS#

song = ‘Yellow – Coldplay’

is_model.get_similar_items([‘XYZ’])

Here a testing size of 20% is taken arbitrarily pick 20% as the testing size. A popularity based recommender class is used as a blackbox to train the model. We create an instance of popularity based recommender class and feed it with our training data.

train_data, test_data = train_test_split(song_df, test_size = 0.20, random_state=0)

print(train_data.head(5))

pm = Recommenders.popularity_recommender_py()

pm.create(train_data, 'user_id', 'song')

user_id = users[9]

pm.recommend(user_id)

Even if we change the user, the result that we get from the system is the same since it is a popularity based recommendation system.

This is a naive approach and not many insights can be drawn from this. To make a more personalised recommender system, item similarity can be considered.

Item Similarity Based Personalized Recommender

Memory based filtering mainly consists of two main methods:

User-item filtering: Users who are similar to you also liked…”
Item-item filtering: users who liked the item you liked also liked…”

Most companies like Netflix use the hybrid approach, which provides a recommendation based on the combination of what content a user like in the past as well as what other similar users like.

#Personalised System Part II

#Creating an instance of item similarity based recommender class

is_model = Recommenders.item_similarity_recommender_py()

is_model.create(train_data, 'user_id', 'song')

#Use the personalized model to make some song recommendations

#Print the songs for the user in training data

user_id = users[9]

user_items = is_model.get_user_items(user_id)

for user_item in user_items:

print(user_item)

#Recommend songs for the user using personalized model

is_model.recommend(user_id)

is_model.get_similar_items(['Mr Sandman - The Chordettes'])

song = ‘Yellow – Coldplay’

is_model.get_similar_items([song])

In item similarity, the main method is “generate_top_recommendation”. So, what this does is it creates a co-occurrence matrix. This matrix can be thought of as a set of data items containing user preferences.

A snippet of code from the file

Here songs are the items. We are calculating weighted average of scores in the co-occurence matrix for all user songs. Then the indices are sort based on their value and the corresponding score.

is_model = Recommenders.item_similarity_recommender_py()

is_model.create(train_data, 'user_id', 'song')

# this prints training data

user_id = users[5]user_items = is_model.get_user_items(user_id)

for user_item in user_items:

print(user_item)

is_model.recommend(user_id)

Output:

The output consists of user_id and its corresponding song name.

This article is an attempt to give a beginner, a guide on how to implement simple song recommender and talk in brief on how to execute the source code for simple application so that this can be taken further and experimented with.

Check the full notebook here.

Also watch:

The post Beginner’s Guide To Building A Song Recommender In Python appeared first on Analytics India Magazine.

Beginner’s Guide To Building A Song Recommender In Python

Item Similarity Based Personalized Recommender

Also watch:

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

Download: FK ft Shenky – Nakuyewa ”Prod by: Shenky”

How to win at Markstrat (Markstrat Tips and Tricks) – Vodites

Ominde Commission Report and Recommendations – Ominde Report of 1964

Bureau of Internal Revenue: Regional Offices (Directory)

GO 53 on Enhancement of Ex-gratia upto 5 Lakhs Toddy Tappers in Telangana

Cakewalk CA-2A Leveling Amplifier v2.0.1.97 WiN, v2.0.1.96 OSX Incl Keygen

Mp3 Download: Mdu - Kunjenjenjena

How the kill the job , when DTP request running for long hours.

Microsoft Intune から展開しているアプリのアップデートについて

18-year-old girl was beaten for half an hour by two Northampton men in 'an...

Car crash in Dunton Bassett leaves driver in critical condition

Macky 2, Two Others In Road Accident

Application log 00000000000000089514: Could not convert queue DLVST90CLNT

Detroit mafia: D’Anna Brothers agree to plea deal

Delivery block field greyed out using VA02

Muloraki Au

【個人撮影】スマホのプライベート映像♪「中に出さないで///」カラオケ屋での生ハメ撮りが流出ｗ【リベンジポルノ】＠PornHub

BREAKING NEWS: Diamond Platnumz Is Reported Dead After Ghastly Car Accident

FIAT 500 B0111 B0112