De-duplicating Collections

Using a filter to hide duplicate files is an optional method of managing the amount of files surveyed.

Running a De-duplication Job

  • Select Additional Analysis from the Evidence menu.

  • Select Flag Duplicate Files from the File Hashes group and click OK.

  • This will start a comparison of the checksums of all files in the collection.

  • When complete close the Data Processing window.

Adding a Duplicate File Column Set to a FTK Case

  • Click the column settings button from the File List window.
  • Click the import button and navigate to Storage(F:)\FTKsettings\ColumnDefs\Duplicates.xml
  • Click OK and close the column settings window.

  • Select Duplicates from the column dropdown.

Reading the Duplicate File Field

  • Note the Duplicate File field in the File List.
  • Note the number in the Duplicate File field.

  • If the Duplicate File field is blank the file has not been analyzed for duplication.

  • 1 indicates the file is duplicated but it is the first instance of that file hashed in the case.
  • 2 indicates the file is a duplicated and not the first instance of that file hashed in the case.
  • 3 indicates the file is unique and does not have a duplicate in the collection.

Adding a Duplicate Filter

  • Click the import filter button from the Filter Manager
  • Navigate to Storage(F:)\FTKsettings\FilterDefs\DuplicateSecondary.xml
  • Click open and OK to import it into the case.
  • Use the filter to either include or exclude secondary duplicates from the File List.

Example: File list excluding secondary duplicates.

Example: File list including only secondary data sets.