De-duplicating Collections
Using a filter to hide duplicate files is an optional method of managing the amount of files surveyed.
Running a De-duplication Job
-
Select Additional Analysis from the Evidence menu.
-
Select Flag Duplicate Files from the File Hashes group and click OK.
- This will start a comparison of the checksums of all files in the collection.
- When complete close the Data Processing window.
Adding a Duplicate File Column Set to a FTK Case
- Click the column settings button from the File List window.
- Click the import button and navigate to Storage(F:)\FTKsettings\ColumnDefs\Duplicates.xml
-
Click OK and close the column settings window.
- Select Duplicates from the column dropdown.
Reading the Duplicate File Field
- Note the Duplicate File field in the File List.
- Note the number in the Duplicate File field.
-
If the Duplicate File field is blank the file has not been analyzed for duplication.
- 1 indicates the file is duplicated but it is the first instance of that file hashed in the case.
- 2 indicates the file is a duplicated and not the first instance of that file hashed in the case.
- 3 indicates the file is unique and does not have a duplicate in the collection.
Adding a Duplicate Filter
- Click the import filter button from the Filter Manager
- Navigate to Storage(F:)\FTKsettings\FilterDefs\DuplicateSecondary.xml
- Click open and OK to import it into the case.
- Use the filter to either include or exclude secondary duplicates from the File List.
Example: File list excluding secondary duplicates.
Example: File list including only secondary data sets.