Substitute value with result of calling function on value in unix shell

Question

I have a text stream that looks like this:

----------------------------------------
s123456789_9780
  heartbeat:test       @ 1344280205000000: '0'
  heartbeat:test       @ 1344272490000000: '0'

Those long numbers are timestamps in microseconds. I would like to run this output through some sort of pipe that will change those timestamps to a more human-understandable date.

I have a date command that can do that, given just the timestamp (with the following colon):

$ date --date=@$(echo 1344272490000000: | sed 's/.......$//') +%Y/%d/%m-%H:%M:%S
2012/06/08-10:01:30

I would like to end up with something like this:

----------------------------------------
s123456789_9780
  heartbeat:test       @ 2012/06/08-12:10:05: '0'
  heartbeat:test       @ 2012/06/08-10:01:30: '0'

I don't think sed will allow me to match the timestamp and replace it with the value of calling a shell function on it (although I'd love to be shown wrong). Perhaps awk can do it? I'm not very familiar with awk.

The other part that seems tricky to me is letting the lines that don't match through without modification.

I could of course write a Python program that would do this, but I'd rather keep this in shell if possible (this is generated inside a shell script, and I'd rather not have dependencies on outside files).

potong · Accepted Answer · 2012-08-09 23:27:14Z

3

This might work for you (GNU sed):

sed '/@ /!b;s//&\n/;h;s/.*\n//;s#\(.\{10\}\)[^:]*\(:.*\)#date --date=@\1 +%Y/%d/%m-%H:%M:%S"\2"#e;H;g;s/\n.*\n//' file

Explanation:

/@ /!b bail out and just print any lines that don't contain an @ followed by a space
s//&\n/ insert a newline after the above pattern
h copy the pattern space (PS) to the hold space (HS)
s/.*\n// delete upto and including the @ followed by a space
s#$.\{10\}$[^:]*$:.*$#date --date=@\1 +%Y/%d/%m-%H:%M:%S"\2"#e from whats remaining in the PS, make a back reference of the first 10 characters and from the : to the end of the string. Have these passed in to the date command and evaluate the result into the PS
H append the PS to the HS inserting a newline at the same time
g copy the HS into the PS
s/\n.*\n// remove the original section of the string

edited Aug 9, 2012 at 23:27

answered Aug 9, 2012 at 20:26

potong

59.3k6 gold badges55 silver badges92 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

michelpm Over a year ago

The e flag works with files, but not with functions. I am using GNU sed version 4.2.1.

potong Over a year ago

@michelpm this probably only works with GNU sed within a linux/unix bash environment.

michelpm Over a year ago

@potong as I said, I am using GNU sed version 4.2.1 and I tested on Ubuntu 13.04 with both bash and zsh. It DOES work with executable files, but not with functions. square () { echo $(($1 * $1)) }, seq () { for (( i = 0 ; i < $1 ; i++ )); do echo $i; done }, seq 10 | sed 's/[0-9]*/square &/ge' complains that function square doesn't exist. It does work if I turn the square function into square.sh and change sed to call the file instead. Any ideas?

michelpm Over a year ago

@potong a inlined version works though: seq 10 | sed 's/[0-9]*/echo \$((& * &))/ge'

chepner · Accepted Answer · 2012-08-09 18:41:53Z

1

Bash with a little sed, preserving the whitespace of the input:

while read -r; do                                                                                                                                                                                                                                          
    parts=($REPLY)
    if [[ ${parts[0]} == "heartbeat:test" ]]; then
        dateStr=$(date --date=@${parts[2]%000000:} +%Y/%d/%m-%H:%M:%S)
        REPLY=$(echo "$REPLY" | sed "s#[0-9]\+000000:#$dateStr#")
    fi
    printf "%s\n" "$REPLY"
done

edited Aug 9, 2012 at 18:41

answered Aug 9, 2012 at 17:18

chepner

538k77 gold badges594 silver badges746 bronze badges

2 Comments

Sam Mussmann Over a year ago

Whitespace preservation is awesome! I changed the sed expression to use # instead of / so I don't have to do all that escaping, but otherwise this works wonderfully! Just for my edification, the parts=($REPLY) line is just splitting $REPLY into an array using the IFS, right?

chepner Over a year ago

That's right. Good idea changing the delimiter for sed; I wish I'd remembered that was an option. I'll update the answer to use #.

Stephane Rouberol · Accepted Answer · 2012-08-09 17:16:17Z

1

How about:

while read s1 at tm s2
do 
    tm=${tm%000000:}
    echo $s1 $at $(date --date @$tm +%Y/%d/%m-%H:%M:%S)
done < yourfile

answered Aug 9, 2012 at 17:16

Stephane Rouberol

4,40421 silver badges18 bronze badges

1 Comment

Sam Mussmann Over a year ago

This barfs on the lines without the timestamps -- I like the simplicity, though.

Thor · Accepted Answer · 2012-08-10 09:55:20Z

1

I would also like to see a sed solution, but it is a bit beyond my sed-fu. As awk supports strftime it is fairly straight forward here:

awk '
/^ *heartbeat/ { 
  gsub(".{7}$", "", $3)
  $3 = strftime("%Y/%d/%m-%T", $3)
  print " ", $1, $3
}

$0 !~ /heartbeat/' file

Output:

s123456789_9780
heartbeat:test 2012/06/08-21:10:05
heartbeat:test 2012/06/08-19:01:30

$3 is the microsecond field. gsub converts the timestamp to seconds.

The $0 !~ makes sure non-heartbeat lines are printed ({ print } implicitly is the default block).

edited Aug 10, 2012 at 9:55

answered Aug 9, 2012 at 17:05

Thor

47.7k12 gold badges125 silver badges140 bronze badges

7 Comments

Sam Mussmann Over a year ago

I don't think gsub is working -- I run this, and I get dates of 42600513/23/11-15:33:20, which is a little bit off. :-)

Thor Over a year ago

That's odd, I've added what I get to the answer. Which version of awk are you using? I've tested this with gawk.

Sam Mussmann Over a year ago

I'm using GNU awk 3.1.6. If I remove the strftime line, then I'm getting the long timestamps.

Thor Over a year ago

Maybe try with the original format string you were using? %Y/%d/%m-%H:%M:%S.

Sam Mussmann Over a year ago

That doesn't seem to work -- if I change the regexp from ".{7}$" to "000000:" it does work, though.

|

jxh · Accepted Answer · 2012-08-09 17:12:11Z

0

This does it mostly within bash using your date command:

#!/bin/bash
IFS=$
while read a ; do
case "$a" in
*" @ "[0-9]*) pre=${a% @ *}
              a=${a#$pre @ }
              post=${a##*:}
              a=${a%??????:$post}
              echo "$pre$(date --date=@$a +%Y/%d/%m-%H:%M:%S):$post"
              ;;
*)            echo "$a" ;;
esac
done <<.
----------------------------------------
s123456789_9780
  heartbeat:test       @ 1344280205000000: '0'
  heartbeat:test       @ 1344272490000000: '0'
.

answered Aug 9, 2012 at 17:12

jxh

70.9k8 gold badges116 silver badges204 bronze badges

Collectives™ on Stack Overflow

Substitute value with result of calling function on value in unix shell

5 Answers 5

4 Comments

2 Comments

1 Comment

7 Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

4 Comments

2 Comments

1 Comment

7 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related