Bash function with array won't work

Question

I am trying to write a function in bash but it won't work. The function is as follows, it gets a file in the format of:

1 2 first 3
4 5 second 6
...

I'm trying to access only the strings in the 3rd word in every line and to fill the array "arr" with them, without repeating identical strings. When I activated the "echo" command right after the for loop, it printed only the first string in every iteration (in the above case "first").

Thank you!

function storeDevNames {

n=0
b=0
while read line; do
    line=$line
    tempArr=( $line )
    name=${tempArr[2]}
    for i in $arr ; do
        #echo ${arr[i]}
        if [ "${arr[i]}" == "$name" ]; then
            b=1
            break
        fi
    done
    if [ "$b" -eq 0 ]; then
        arr[n]=$name
        n=$(($n+1))
    fi
    b=0
done < $1
}

choroba: I call it using "storeDevNames a.txt". I am printing the array in a different function. I'll try to see your answer. Thanks! — Gal Fleissig
– Gal Fleissig, Commented Apr 5, 2015 at 8:49
Are you sure that the input file is separated by regular spaces, and not unbreaking space characters? stackoverflow.com/questions/11272374/… — asimovwasright
– asimovwasright, Commented Apr 5, 2015 at 9:00

choroba · Accepted Answer · 2015-04-05 09:11:24Z

1

The following line seems suspicious

    for i in $arr ; do

I changed it as follows and it works for me:

#! /bin/bash

function storeDevNames {
    n=0
    b=0
    while read line; do
        # line=$line # ?!
        tempArr=( $line )
        name=${tempArr[2]}
        for i in "${arr[@]}" ; do
            if [ "$i" == "$name" ]; then
                b=1
                break
            fi
        done
        if [ "$b" -eq 0 ]; then
            arr[n]=$name
            (( n++ ))
        fi
        b=0
    done
}

storeDevNames < <(cat <<EOF 
1 2 first 3
4 5 second 6
7 8 first 9
10 11 third 12
13 14 second 15
EOF
)

echo "${arr[@]}"

edited Apr 5, 2015 at 9:11

answered Apr 5, 2015 at 8:40

choroba

245k27 gold badges221 silver badges304 bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

Gal Fleissig Over a year ago

You're right, it does print the whole array. I still don't get two things: 1. Why does it store two identical strings? Is there something wrong with my if-else? 2. Why doesn't it print every single element in the array with this echo command but prints only the first one every time?

choroba Over a year ago

@GalFl: I don't understand. I'm getting no duplicates. Show the code that produces them in the question.

Gal Fleissig Over a year ago

I tried it now with a simple txt file and it worked with no duplicates, but I tried it with a .comp file (the format i have to work with) and it does show duplicates. Maybe it's something with this format? maybe there is something different with the spaces or end of lines?

Gal Fleissig Over a year ago

It works! So just to fully understand: the "i" in the for loop represents a number (like in C for example) or in this case a string?

choroba Over a year ago

It's the string. If you wanted numbers, you'd need something like for i in $(seq 0 ${#arr[@]}) or for ((i=0; i<${#arr[@]}; i++))

David C. Rankin · Accepted Answer · 2015-04-05 08:47:12Z

1

You can replace all of your read block with:

arr=( $(awk '{print $3}' <"$1" | sort | uniq) )

This will fill arr with only unique names from the 3rd word such as first, second, ... This will reduce the entire function to:

function storeDevNames {
    arr=( $(awk '{print $3}' <"$1" | sort | uniq) )
}

Note: this will provide a list of all unique device names in sorted order. Removing duplicates also destroys the original order. If preserving the order accept where duplicates are removed, see 4ae1e1's alternative.

edited Apr 5, 2015 at 8:47

answered Apr 5, 2015 at 8:38

David C. Rankin

85.1k6 gold badges67 silver badges95 bronze badges

2 Comments

4ae1e1 Over a year ago

Your answer breaks the order of lines, which might (or might not) be important. See the other awk answer (disclosure: mine) for how to preserve the order.

David C. Rankin Over a year ago

Indeed it does, if retaining the order of device names is important, then your answer is the one to use.

4ae1e1 · Accepted Answer · 2015-04-05 08:52:04Z

1

You're using the wrong tool. awk is designed for this kind of job.

awk '{ if (!seen[$3]++) print $3 }' <"$1"

This one-liner prints the third column of each line, removing duplicates along the way while preserving the order of lines (only the first occurrence of each unique string is printed). sort | uniq, on the other hand, breaks the original order of lines. This one-liner is also faster than using sort | uniq (for large files, which doesn't seem to be applicable in OP's case), since this one-liner linearly scans the file once, while sort is obviously much more expensive.

As an example, for an input file with contents

1 2 first 3
4 5 second 6
7 8 third 9
10 11 second 12
13 14 fourth 15

the above awk one-liner gives you

first
second
third
fourth

To put the results in an array:

arr=( $(awk '{ if (!seen[$3]++) print $3 }' <"$1") )

Then echo ${arr[@]} will give you first second third fourth.

edited Apr 5, 2015 at 8:52

answered Apr 5, 2015 at 8:41

4ae1e1

7,7249 gold badges51 silver badges78 bronze badges

2 Comments

Gal Fleissig Over a year ago

This looks like a really good solution, but since I'm a beginner in bash I'm trying to write the simplest code to understand rather than the most efficient to write. Plus we are probably not allowed to use "awk". Thank you very much!

4ae1e1 Over a year ago

@GalFl No problem, this might still help future users.

Collectives™ on Stack Overflow

Bash function with array won't work

3 Answers 3

5 Comments

2 Comments

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

5 Comments

2 Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related