De-duplicating Collections

Using a filter to hide duplicate files is an optional method of managing the amount of files surveyed.

Running a De-duplication Job

Select Additional Analysis from the Evidence menu.
Select Flag Duplicate Files from the File Hashes group and click OK.

This will start a comparison of the checksums of all files in the collection.

When complete close the Data Processing window.

Adding a Duplicate File Column Set to a FTK Case

Click the column settings button from the File List window.
Click the import button and navigate to Storage(F:)\FTKsettings\ColumnDefs\Duplicates.xml
Click OK and close the column settings window.
Select Duplicates from the column dropdown.

Reading the Duplicate File Field

Note the Duplicate File field in the File List.
Note the number in the Duplicate File field.

If the Duplicate File field is blank the file has not been analyzed for duplication.
1 indicates the file is duplicated but it is the first instance of that file hashed in the case.
2 indicates the file is a duplicated and not the first instance of that file hashed in the case.
3 indicates the file is unique and does not have a duplicate in the collection.

Adding a Duplicate Filter

Click the import filter button from the Filter Manager
Navigate to Storage(F:)\FTKsettings\FilterDefs\DuplicateSecondary.xml
Click open and OK to import it into the case.
Use the filter to either include or exclude secondary duplicates from the File List.

Example: File list excluding secondary duplicates.

Example: File list including only secondary data sets.