How can I store the "find" command results as an array in Bash

Question

I am trying to save the result from find as arrays. Here is my code:

#!/bin/bash

echo "input : "
read input

echo "searching file with this pattern '${input}' under present directory"
array=`find . -name ${input}`

len=${#array[*]}
echo "found : ${len}"

i=0

while [ $i -lt $len ]
do
echo ${array[$i]}
let i++
done

I get 2 .txt files under current directory. So I expect '2' as result of ${len}. However, it prints 1. The reason is that it takes all result of find as one elements. How can I fix this?

P.S
I found several solutions on StackOverFlow about a similar problem. However, they are a little bit different so I can't apply in my case. I need to store the results in a variable before the loop. Thanks again.

John1024 · Accepted Answer · 2020-05-04 18:54:37Z

216

Update 2020 for Linux Users:

If you have an up-to-date version of bash (4.4-alpha or better), as you probably do if you are on Linux, then you should be using Benjamin W.'s answer.

If you are on Mac OS, which —last I checked— still used bash 3.2, or are otherwise using an older bash, then continue on to the next section.

Answer for bash 4.3 or earlier

Here is one solution for getting the output of find into a bash array:

array=()
while IFS=  read -r -d $'\0'; do
    array+=("$REPLY")
done < <(find . -name "${input}" -print0)

This is tricky because, in general, file names can have spaces, new lines, and other script-hostile characters. The only way to use find and have the file names safely separated from each other is to use -print0 which prints the file names separated with a null character. This would not be much of an inconvenience if bash's readarray/mapfile functions supported null-separated strings but they don't. Bash's read does and that leads us to the loop above.

[This answer was originally written in 2014. If you have a recent version of bash, please see the update below.]

How it works

The first line creates an empty array: array=()
Every time that the read statement is executed, a null-separated file name is read from standard input. The -r option tells read to leave backslash characters alone. The -d $'\0' tells read that the input will be null-separated. Since we omit the name to read, the shell puts the input into the default name: REPLY.
The array+=("$REPLY") statement appends the new file name to the array array.
The final line combines redirection and command substitution to provide the output of find to the standard input of the while loop.

Why use process substitution?

If we didn't use process substitution, the loop could be written as:

array=()
find . -name "${input}" -print0 >tmpfile
while IFS=  read -r -d $'\0'; do
    array+=("$REPLY")
done <tmpfile
rm -f tmpfile

In the above the output of find is stored in a temporary file and that file is used as standard input to the while loop. The idea of process substitution is to make such temporary files unnecessary. So, instead of having the while loop get its stdin from tmpfile, we can have it get its stdin from <(find . -name ${input} -print0).

Process substitution is widely useful. In many places where a command wants to read from a file, you can specify process substitution, <(...), instead of a file name. There is an analogous form, >(...), that can be used in place of a file name where the command wants to write to the file.

Like arrays, process substitution is a feature of bash and other advanced shells. It is not part of the POSIX standard.

Alternative: lastpipe

If desired, lastpipe can be used instead of process substitution (hat tip: Caesar):

set +m
shopt -s lastpipe
array=()
find . -name "${input}" -print0 | while IFS=  read -r -d $'\0'; do array+=("$REPLY"); done; declare -p array

shopt -s lastpipe tells bash to run the last command in the pipeline in the current shell (not the background). This way, the array remains in existence after the pipeline completes. Because lastpipe only takes effect if job control is turned off, we run set +m. (In a script, as opposed to the command line, job control is off by default.)

Additional notes

The following command creates a shell variable, not a shell array:

array=`find . -name "${input}"`

If you wanted to create an array, you would need to put parens around the output of find. So, naively, one could:

array=(`find . -name "${input}"`)  # don't do this

The problem is that the shell performs word splitting on the results of find so that the elements of the array are not guaranteed to be what you want.

Update 2019

Starting with version 4.4-alpha, bash now supports a -d option so that the above loop is no longer necessary. Instead, one can use:

mapfile -d $'\0' array < <(find . -name "${input}" -print0)

For more information on this, please see (and upvote) Benjamin W.'s answer.

edited May 4, 2020 at 18:54

answered Apr 29, 2014 at 6:38

John1024

115k15 gold badges152 silver badges183 bronze badges

Sign up to request clarification or add additional context in comments.

31 Comments

John1024 Over a year ago

@JuneyoungOh Glad it helped. I added a section of process substitution.

John1024 Over a year ago

@Rockallite That is a good observation but incomplete. While it is true that we don't split into multiple words, we still need IFS= to avoid removal of whitespace from the beginnings or ends of the input lines. You can test this easily by comparing the output of read var <<<' abc '; echo ">$var<" with the output of IFS= read var <<<' abc '; echo ">$var<". In the former case, the spaces before and after abc are removed. In the latter, they aren't. File names that begin or end with whitespace may be unusual but, it they exist, we want them processed correctly.

Przemysław Sienkiewicz Over a year ago

Hi, after i execute your code i get message syntax error near unexpected token <' done < <(find aaa/ -not -newermt "$last_build_timestamp_v" -type f -print0)'

glenn jackman Over a year ago

A note: the simpler '' can be used instead of $'\0':

n=0; while IFS= read -r -d '' line || [ "$line" ]; do echo "$((++n)):$line"; done < <(printf 'first\nstill first\0second\0third')

John1024 Over a year ago

@theeagle I assume that you intended to write BLAH=$(find . -name '*.php'). As discussed in the answer, that approach will work in limited cases but it won't work in general with all filenames and it doesn't produce, as the OP expected, an array.

|

Community · Accepted Answer · 2020-06-20 09:12:55Z

141

Bash 4.4 introduced a -d option to readarray/mapfile, so this can now be solved with

readarray -d '' array < <(find . -name "$input" -print0)

for a method that works with arbitrary filenames including blanks, newlines, and globbing characters. This requires that your find supports -print0, as for example GNU find does.

From the manual (omitting other options):

mapfile [-d delim] [array]
-d
The first character of delim is used to terminate each input line, rather than newline. If delim is the empty string, mapfile will terminate a line when it reads a NUL character.

And readarray is just a synonym of mapfile.

edited Jun 20, 2020 at 9:12

CommunityBot

11 silver badge

answered Feb 6, 2019 at 19:53

Benjamin W.

54k19 gold badges135 silver badges136 bronze badges

5 Comments

dpritch Over a year ago

This is great, I've already given it a +1. There's one caveat though -- if the command inside the process substitution fails, the exit code of the overall command is still 0. Is there a good way to have the exit code propagated to the outer command?

Benjamin W. Over a year ago

@dpritch Inspired by this answer, you could print the exit status as part of the process substitution: readarray -d '' array < <(find . -name "$input" -print0; printf "$?"), and then examine the last array element: echo "${array[-1]}".

dpritch Over a year ago

^^ This is brilliant!

markling Over a year ago

Why is -print0 necessary when it seems sufficient to enclose the find command in double quotes to handle filenames with special chars (even double quotes), like so: `files=("$(find -type f)").

Benjamin W. Over a year ago

@markling It's necessary because filenames are allowed to include linebreaks; the null byte is the only character that can't possibly appear in a filenmae. Two good references for this: Fixing Unix/Linux/POSIX Filenames: Control Characters (such as Newline), Leading Dashes, and Other Problems, Filenames and Pathnames in Shell: How to do it Correctly

sunknudsen · Accepted Answer · 2020-09-19 13:16:46Z

45

The following appears to work for both Bash and Z Shell on macOS.

#! /bin/sh

IFS=$'\n'
paths=($(find . -name "foo"))
unset IFS

printf "%s\n" "${paths[@]}"

edited Sep 19, 2020 at 13:16

answered Sep 19, 2020 at 12:52

sunknudsen

7,5207 gold badges51 silver badges94 bronze badges

6 Comments

Stéphane Gourichon Over a year ago

This works with files having spaces and other special characters, fails with the (admittedly rare) case of files having a linebreak in their name. You can create one for a test with printf "%b" "file name with spaces, a star * ...\012and a second line\0" | xargs -0 touch

Matt Korostoff Over a year ago

maybe I'm missing something here, but this seems like the much clearer, easier solution for 99% of cases

pathfinder Over a year ago

Definitely works great for zsh on macOS Big Sur :) thanks! - but I also know my fileset has no names with newlines, because who does that? I have never seen one in the wild and I made the files so I know its not an issue.

OLEGSHA Over a year ago

Newlines are an issue in case the script may operate on files that are supplied by a potentially malicious user. For a hypothetical example, if your system ran something like detect-malware "${paths[@]}", a virus could be smuggled past this defense by including a newline in its name.

pjh Over a year ago

See Bash Pitfalls #1 (for f in $(ls *.mp3)).

|

chepner · Accepted Answer · 2014-04-29 18:07:19Z

24

If you are using bash 4 or later, you can replace your use of find with

shopt -s globstar nullglob
array=( **/*"$input"* )

The ** pattern enabled by globstar matches 0 or more directories, allowing the pattern to match to an arbitrary depth in the current directory. Without the nullglob option, the pattern (after parameter expansion) is treated literally, so with no matches you would have an array with a single string rather than an empty array.

Add the dotglob option to the first line as well if you want to traverse hidden directories (like .ssh) and match hidden files (like .bashrc) as well.

edited Apr 29, 2014 at 18:07

answered Apr 29, 2014 at 17:58

chepner

538k77 gold badges594 silver badges746 bronze badges

5 Comments

kojiro Over a year ago

Maybe nullglob too…

chepner Over a year ago

Yeah, I always forget that.

gniourf_gniourf Over a year ago

Note that this will not include the hidden files and directories, unless dotglob is set (this may or may not be wanted, but it's worth mentioning too).

Guss Over a year ago

This looks very useful, unless you actually need find's more interesting file matching features which aren't name glob based (for example, find by type, date, etc).

chepner Over a year ago

Indeed. find still has it uses (unless you are using zsh, in which case I think just about anything find can do you can do with some unreadable set of glob qualifiers :) )

Ahmed Al-Haffar · Accepted Answer · 2015-08-06 06:02:32Z

14

you can try something like

array=(`find . -type f | sort -r | head -2`)

, and in order to print the array values , you can try something like echo "${array[*]}"

answered Aug 6, 2015 at 6:02

Ahmed Al-Haffar

5504 silver badges19 bronze badges

2 Comments

gniourf_gniourf Over a year ago

Breaks if there are filenames with spaces or glob characters.

Ed Morton Dec 14, 2024 at 11:27

You should copy/paste that into shellcheck.net and fix the issues it'll tell you about.

zappee · Accepted Answer · 2024-07-28 15:38:20Z

5

The suggested solutions are nice but I think that nobody mentioned the most simple and trivial solution.

This:

jars_home=/home/xxx/jars
files=($(find "$jars_home" -type f -name "*.jar"))

It uses the standard bash array declaration syntax:

myArray=("red" "yellow" "blue" "green")

Then:

array length: echo ${#files[@]} # 2
1st element: echo ${files[0]} # /home/xxx/jars/hello.jar
2nd element: echo ${files[1]} # /home/xxx/jars/something-else.jar
print the array elements: printf '%s\n' "${files[@]}"

edited Jul 28, 2024 at 15:38

answered Jul 28, 2024 at 15:17

zappee

23.2k15 gold badges97 silver badges157 bronze badges

6 Comments

Fernando Crespo Over a year ago

This is the way!

Ed Morton Over a year ago

That'll fail given various characters in the file names it finds, the contents of the directory you run it from, environment settings, etc. as mentioned in comments to other similar answers, e.g. stackoverflow.com/a/31847896/1745001. It's not doing myArray=("red *" "yellow" "blue" "green") as you think (I added the * to show the globbing issue), it's doing myArray=(red * yellow blue green) which has very different results.

zappee Dec 11, 2024 at 1:31

@EdMorton I think that you misunderstood something here. If you would like to use joker characters in your find command then you must add it after the -name param. The array with color names is just an example result, nothing else.

Ed Morton Dec 11, 2024 at 12:38

You need to use readarray -d '' files < <(find "$jars_home" -type f -name "*.jar" -printf '%p\0') or similar to robustly populate files[] with the output of find.

zappee Dec 11, 2024 at 15:29

Thanks for the correction/help/explanation. I will update the code in my post accordingly.

|

Funmungus · Accepted Answer · 2021-10-27 15:30:29Z

2

None of these solutions suited me because I didn't feel like learning readarray and mapfile. Here is what I came up with.

#!/bin/bash

echo "input : "
read input

echo "searching file with this pattern '${input}' under present directory"
# The only change is here. Append to array for each non-empty line.
array=()
while read line; do
    [[ ! -z "$line" ]] && array+=("$line")
done; <<< $(find . -name ${input} -print)

len=${#array[@]}
echo "found : ${len}"

i=0

while [ $i -lt $len ]
do
echo ${array[$i]}
let i++
done

answered Oct 27, 2021 at 15:30

Funmungus

1441 silver badge4 bronze badges

2 Comments

Pujianto Over a year ago

I like this one. But shellcheck asked me to remove the semicolon in this line done; <<<

Ed Morton Dec 14, 2024 at 11:25

You should copy/paste that into shellcheck.net and fix the issues it'll tell you about.

suhas jadhav · Accepted Answer · 2024-05-31 07:53:27Z

-1

Long way but it works for me...

findCms=$(find -name <name>)
IFS=' ' read -ra arr <<< $findCms
for i in "${arr[@]}"; do
  echo "$i"
done

another way...

findCmd=$(find -name text-to-find -print)
while IFS= read -r line; do
    echo "Line: $line"
done <<< "$findCmd"

edited May 31, 2024 at 7:53

answered May 29, 2024 at 14:40

suhas jadhav

92 bronze badges

1 Comment

Ed Morton Dec 14, 2024 at 11:25

You should copy/paste that into shellcheck.net and fix the issues it'll tell you about.

user1357768 · Accepted Answer · 2014-04-29 07:07:32Z

-2

You could do like this:

#!/bin/bash
echo "input : "
read input

echo "searching file with this pattern '${input}' under present directory"
array=(`find . -name '*'${input}'*'`)

for i in "${array[@]}"
do :
    echo $i
done

answered Apr 29, 2014 at 7:07

user1357768

1001 bronze badge

1 Comment

Juneyoung Oh Over a year ago

Thanks. a lot. But as @anishsane pointed, empty spaces in filename should be considered in my program. Anyway Thanks!

Benjamin W. · Accepted Answer · 2019-02-08 19:37:18Z

-3

In bash, $(<any_shell_cmd>) helps to run a command and capture the output. Passing this to IFS with \n as delimiter helps to convert that to an array.

IFS='\n' read -r -a txt_files <<< $(find /path/to/dir -name "*.txt")

edited Feb 8, 2019 at 19:37

Benjamin W.

54k19 gold badges135 silver badges136 bronze badges

answered Jan 26, 2018 at 9:43

rashok

13.7k17 gold badges93 silver badges103 bronze badges

2 Comments

Benjamin W. Over a year ago

This will get only the first file of the results of find into the array.

Ed Morton Dec 14, 2024 at 11:26

You should copy/paste that into shellcheck.net and fix the issues it'll tell you about.

Collectives™ on Stack Overflow

How can I store the "find" command results as an array in Bash

10 Answers 10

Update 2020 for Linux Users:

Answer for bash 4.3 or earlier

How it works

Why use process substitution?

Alternative: lastpipe

Additional notes

Update 2019

31 Comments

5 Comments

6 Comments

5 Comments

2 Comments

6 Comments

2 Comments

1 Comment

1 Comment

2 Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

10 Answers 10

Update 2020 for Linux Users:

Answer for bash 4.3 or earlier

How it works

Why use process substitution?

Alternative: lastpipe

Additional notes

Update 2019

31 Comments

5 Comments

6 Comments

5 Comments

2 Comments

6 Comments

2 Comments

1 Comment

1 Comment

2 Comments

Linked

Related