Shell Scripting -- need some awk/sed help...

bcredeur97 · May 15, 2017

Changing some old C code to a ksh shell script, need to grab the numbers at the end of this line from the * to the tilda. Regular expressions has never really been something I'm good at lol.. so hoping someone here would know

G50*N*20160207*18885~

^^^^^^^

the first part is always static up till the end of the date (20170505), after that asterisk I need to grab the numbers that follow which can be in length anywhere from 1 to 25 characters then it ends in a tilda.

unijab · May 15, 2017

awk -F \* '{print $4}'

macedot · May 15, 2017

Well.. it really depends of how the data pattern will be. If you can guarantee it will always be at the end...


user@host:~$ echo "G50*N*20170505*18885~"| rev | cut -d'*' -f1 | rev | sed -e 's/\~//g'
18885
user@host:~$

Which:

- Prints the given data

- reverse it

- cuts only the first column, using * as delimiter

- reverse it back

- remove tilda

You can also try awk:


user@host:~$ echo "G50*N*20170505*18885~"| awk -F '\*' '{print $4}' | sed -e 's/\~//g'
18885
user@host:~$

Which is about the same but from bold part, which prints 4th column at data (as unijab post).

bcredeur97 · May 15, 2017

@tmacedo yes the pattern will always be constant up to the point where you reach the numbers I want after that 3rd asterisk.

These lines are in what is essentially a text file, I just need it to only do this on lines that begin with 'G50' or "BEG'

but I see what you did there with the reversing of the statements and cutting the part that I need off.. Thank you! makes more sense now

edit: simply grepping for lines with G50 or BEG at the beginning of that should work

Azgoth 2 · May 15, 2017

After some quick testing with awk: awk 'match($0, /*([0-9]+)~/, res) { print res[1] }' file1 file2 ...

Where:

match($0, /*([0-9]+)~/, res) matches the regular expression *([0-9]+)~ (regular expressions are enclosed by // in awk) to $0 (i.e., stdin) and saves the result to res.
{ print res[1] } prints the first item of the (0-indexed) result array. Since there were capturing parentheses in the regular expression, this is what was inside the parentheses. (The 0th item in the array for this bit of code is the full match, as if there weren't capturing parentheses there).
file1 file 2... are your files you need to find this pattern in.

For the line G50*N*20160207*18885~ , this should return just 18885.

Sign In

Shell Scripting -- need some awk/sed help...

Link to comment

Share on other sites

Link to post

Share on other sites

Link to comment

Share on other sites

Link to post

Share on other sites

Link to comment

Share on other sites

Link to post

Share on other sites

Link to comment

Share on other sites

Link to post

Share on other sites

Link to comment

Share on other sites

Link to post

Share on other sites

Create an account or sign in to comment

Create an account

Sign in

Topics

Latest From Linus Tech Tips:

The Future of PC Cooling?

Latest From ShortCircuit:

The coolest looking monitor. Period. - ASUS ROG display at Computex (Sponsored)

Latest From TechLinked:

Microsoft Just Can’t Help Itself

Latest From GameLinked:

Gamers, We’re Eatin’ Good

Latest From Tech Quickie:

Who's Tracking Your Phone Right Now?

Latest From The WAN Show:

Pizza Hut is Being Sued Over AI

My Activity Streams