• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Skip to footer navigation

The Geek Diary

  • OS
    • Linux
    • CentOS/RHEL
    • Solaris
    • Oracle Linux
    • VCS
  • Interview Questions
  • Database
    • oracle
    • oracle 12c
    • ASM
    • mysql
    • MariaDB
  • DevOps
    • Docker
    • Shell Scripting
  • Big Data
    • Hadoop
    • Cloudera
    • Hortonworks HDP

duperemove: command not found

by Deepika

duperemove utility finds duplicate filesystem extents and optionally schedule them for deduplication. An “extent” is a small part of a file that is stored within a filesystem. On some filesystems, one extent can be referenced multiple times when the contents of different files are identical. This is known as “extent sharing” or “deduplication” and it can save disk space by eliminating the need to store multiple copies of the same data.

The duperemove command can be used to find and remove these duplicate extents, which can help to free up disk space. It works by comparing the contents of files and identifying extents that are identical. Once duplicate extents are found, the command can be used to schedule them for deduplication, which will remove the duplicate extents and free up disk space.

If you encounter the below error while running the command duperemove:

duperemove: command not found

you may try installing the below package as per your choice of distribution:

Distribution Command
Debian apt-get install duperemove
Ubuntu apt-get install duperemove
Kali Linux apt-get install duperemove
Fedora dnf install duperemove

duperemove Command Examples

1. Search for duplicate extents in a directory and show them:

# duperemove -r path/to/directory

2. Deduplicate duplicate extents on a Btrfs or XFS (experimental) filesystem:

# duperemove -r -d path/to/directory

3. Use a hash file to store extent hashes (less memory usage and can be reused on subsequent runs):

# duperemove -r -d --hashfile=path/to/hashfile path/to/directory

4. Limit I/O threads (for hashing and dedupe stage) and CPU threads (for duplicate extent finding stage):

# duperemove -r -d --hashfile=path/to/hashfile --io-threads=N --cpu-threads=N path/to/directory

Filed Under: Linux

Some more articles you might also be interested in …

  1. bmaptool Command Examples (Create or copy block maps intelligently)
  2. How to use wget to download file via proxy
  3. bluetoothd Command Examples in Linux
  4. grub-bios-setup: command not found
  5. How to Change Default Port of Apache On RHEL/CentOS 7
  6. What are SELinux Modes and how to set them
  7. scriptreplay: command not found
  8. mesg Command Examples in Linux
  9. repo-add Command Examples in Linux
  10. i3lock Command Examples in Linux

You May Also Like

Primary Sidebar

Recent Posts

  • ctags: Generates an index (or tag) file of language objects found in source files for many popular programming languages
  • csvtool: Utility to filter and extract data from CSV formatted sources
  • csvstat: Print descriptive statistics for all columns in a CSV file
  • csvsql: Generate SQL statements for a CSV file or execute those statements directly on a database

© 2023 · The Geek Diary

  • Archives
  • Contact Us
  • Copyright