Building a Python Script That Clusters Photos Based on the Faces

Used Google photos before? You’ve probably seen this in action.

Divakar Rajesh

Published in

The Startup

5 min readJan 15, 2021

So, what are we building?

We are going to group a bunch of photos based on who’s in the picture
We’ll be leveraging the face_recognition python module specifically
Yo! Juz gimme the code: https://github.com/sdivakarrajesh/face-clustering
You can use the requirements.txt to install all the dependencies

Let’s get familiar with the module first

The face recognition module provides us the functions to load a photo, get the encodings of the faces in the photo and also compare them, saying if the two encodings match or not.

We’ll use two photos of Kylie(one with black hair and another with blonde hair) and one photo of Khloe

Running the program gives this result. Even though Kylie has different hair colors only her face data is used to decide if they are the same person. Nice!

Let’s cluster a bunch of images now, shall we?

A sample set of 25 images of 3 football players, are in the GitHub repo mentioned above, so you can also follow along!

Let’s try to get all the photos that we have in the “dataset” directory

2. Now we should process each image, checking if the face in the image matches any cluster we have, if it matches, we add it to the cluster, if it doesn’t we create a new cluster

3. Putting this all together and running the file should group the images in “results” folder in the current working directory inside their own respective folders representing each cluster

(The complete code can be found in the GitHub Repo)

Yay! — we can now cluster images based on who’s in it. But wait a second, this CLI app won’t scale well if you are trying to build let’s say a desktop GUI or Android application around it. Some things that you might want to consider include:

Representative Image for cluster: Promoting an image to a represent a cluster. So, every time when we are trying to check if an image belongs to a cluster, we compare it with just that one representative image and not all the images that the cluster has
Parameter Tuning: Of course, this out-of-the-box solution by this library will not suit all use cases. We might want to tune parameters such as tolerance to get better results
Multiple faces: And if you followed along you might also notice, that we only take the first face that we find on the image and in real photos that might not be the case
Database and OOMS: Throw in a SQLite database or whatever and try to persist the data on disk rather than holding all encodings at the same time in memory like the “encodings” dictionary that we have in our script
Same or Different: One nice thing that I also like about google photos is that, when it realizes some clusters have somewhat closer resemblance it suggests “Same or different person” — that asks for human input.

So, that’s it? — Wait we have something interesting to share!

Sensara’s Video Reasoning platform

We at Sensara have been building something interesting that helps identify what’s in a video. This is especially useful for an OTT platform, where recommendations and user interests can be improvised manifold if we can understand what’s there in the video that the users watch.

Banners, Trailers, Boilerplates, Tunes, Detail pages, we’ve create them all, just out of the given video.

We mine this almost in real-time from linear TV as well.

Of course that is built with a more sophisticated solution than the above script, duh! 🤭

And that is used to power recommendations and detail pages in popular D2H boxes including Airtel. This help us build enriched detail pages of people with movies they are on, link to relevant OTT apps and also future TV shows they appear on.

You can also try it in action with the “Sensy India TV Guide & Remote” & “Mi Remote controller” Android apps

Sensy India TV Guide & Remote - Apps on Google Play

Sensy converts your mobile phone into a TV Remote, if your phone supports InfraRed. Sensy can be a remote on phones…

play.google.com

Mi Remote controller - for TV, STB, AC and more - Apps on Google Play

Control your electric appliances with your phone using Mi Remote. Whenever you can't find your remote or feel like…