Skip to content

A python script to find and delete (optionally) deleted photos using MD5 Hash.

License

Notifications You must be signed in to change notification settings

vybhav72954/Duplicate_Images

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Duplicate Image Finder

forthebadge made-with-python

Many a times, we find duplicate images residing in our Albums, Image_directory etc, there ane various reasons, downloading same file form various sources, auto backup on cloud, it slipped out of our mind that we downloaded it already in the first place etc. Manually selecting them is actually a hassle, but why do such boring task when automation can do the trick. This sweet and simple script, helps you to compare various files (not only images) in a directory, find the duplicate, list them out, and then even allows you to delete them. Sweet!!!

Setup

  • Setup a python 3.x virtual environment.
  • Activate the environment
  • Install the dependencies using pip3 install -r requiremnts.txt
  • You are all set and the script is Ready to run.
  • Clearly Follow the Instructions provided in the comments.

Usage

In Command Line Interface, Run the script using -

python image_finder.py <path of folder1, path of folder2, .....>

       1. folder1 - Parent Folder

       2. folder2, folder3 .... - Subsequent Folders

Comparisons are done with in the folder, and from Parent to Subsequent Folders.

Dependencies

  1. python3
  2. keyboard

Detailed explanation

The Script works on a simple fundamental. Two files with same md5checksum will have similar contents. So in the script all we aim to do is determine the checksum, compare and find the duplicates.

Folder Structure

  • Stand_Alone folder has 6 images, 2 of them are duplicate of images within the folder only.
  • Parent contains standard images used for Image Processing in png format.
  • Duplicate folder contains 5 images duplicate of images in Stand_Alone (named Random Name (n)). There are similar images in tiff extension as well, They are not Duplicate as file type is different.
  • Duplicate_1 folder contains another 5 images duplicate of images in Stand_Alone (named Another Random Name (n)). There are similar images in jpeg extension as well, They are not Duplicate as file type is different.

Output

  • Running Script on a single folder Stand_Alone. In this example I pressed [n] in order to not to delete anything.

Pasting the Magnet Link

  • Stand_Alone folder Before Deleting the files.

Pasting the Magnet Link

  • After Deleting the Files, i.e. Pressing [y] at the prompt.

Pasting the Magnet Link

  • Parent, Duplicate, Duplicate_1 folder before running the script.

Pasting the Magnet Link

  • Running the scripts on the Folder and deleting the duplicate files.

Pasting the Magnet Link

Pasting the Magnet Link

  • Final Result, Notice that all the files in Parent Folder remain as it is.

Pasting the Magnet Link

Author(s)

Made by Vybhav Chaturvedi

Check Rotten Scripts for more such awesome Scripts and useful codes.

About

A python script to find and delete (optionally) deleted photos using MD5 Hash.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages