ML
    • Recent
    • Categories
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login
    1. Topics
    2. HelloWill
    3. Best
    H
    • Profile
    • Following 0
    • Followers 0
    • Topics 3
    • Posts 6
    • Best 2
    • Controversial 0
    • Groups 0

    Best posts made by HelloWill

    • How to validate data migration and dedupe files and video?

      We are in the process of consolidating all our data from multiple servers down to one. We have about 250 TB of data, and we’ve moved about 200 TB of stuff already, and I’m seeing we have tons of duplicate files and folders.

      Our data is a mixture of RAW video (.R3D), pictures and general business data. Some of our data has alternate data streams. I need to make sure all our data transferred correctly and all the metadata and ADS is intact.

      What’s the best way to easily verify all the files are the same and then deduplicate? I’m looking for those who’ve been there, done that for some advice on tools and methods to make sure everything is good.

      posted in IT Discussion
      H
      HelloWill
    • What's the Best Way to Deduplicate & Organize Files/Folders on a 200 TB NAS?

      We have a large FreeNAS server that is loaded with files. I am looking for advice on the best way to get things cleaned up, and I know there's tons of duplicates.

      File Types:

      • Images
      • Text
      • Videos

      File Counts:

      • 10,000,000+ Files
      • 200+ TB

      I've tried running many other duplicate scanners, but they haven't been easy because the scanners crash when they get logs too big, it's hard to get context, and it takes days to scan without checksums (Takes a really long time to checksum (MD5) files). And to top it off, they only run on one PC so I can't even enlist the rest of the team to help clean up.

      I need a way to make it so that we can easily scan files, identify duplicates, and be able to ideally save scan results and checksums such that we don't need to keep re-scanning the same files again and again. I like beyond compare, but it helps after the duplicates have been identified.

      What do you guys do to scan this much data and make sense of it / organize it?

      posted in IT Discussion
      H
      HelloWill
    • 1 / 1