How to copy or move millions of files concurrently in linux
Sometimes, you need to copy or move files from one drive to another. Copying or moving files using cp
/ mv
one by one can take considerable time some times days and weeks depending upon number of files, size of individual file and disk speed.
Here’s one way to speed up this process by using which you can copy / move millions of files concurrently from one directory to another in linux utilizing all CPU threads.
find . -type f | parallel --bar cp {} /path/to/destination
Break down of this command:
find . -type f
means find all files in current directory. We then pipe this list to a tool called parallel
which runs cp
on all files in multithreaded fashion. {}
is a placeholder for the current file to be copied to the destination folder, which is /path/to/destination
in this case. And --bar
is used to display a progress bar.
You might have to install parallel
package (using apt
, pacman
, dnf
, yum
or whatever package manager your distro has) if its not already installed. Its available in all main line linux distros.
I learned this trick during my job as a backend developer. It can save you some time if you are trying to copy millions of large files from one drive to another. Sometimes, it can bring the copy / move time to hours from days and to days from weeks!
Enjoy Reading This Article?
Here are some more articles you might like to read next: