Oracle SQL - Regular Expression matching using REGEXP_REPLACE()

Question

Good morning,

I am hoping to find assistance with writing a select query to remove some text from a column.

I have created a column called "TEXT_MINING" in a previous query, that some code a different developer wrote will perform some text mining analysis on. The TEXT_MINING column has text that looks like:

EMPLOYEE FOUND BROKEN HANDLE ON HAMMER * 02-08-18 15:19:22 PM * I found a hammer that had the wood split on the handle, tossed into scrap.

I want to remove the * and all of the text in between the two * to help my software engineer do some text mining. Here is my current dilemma:

Not only do I not know how to use REGEXP_REPLACE, but I can't get the REGEXP worked out. I currently have:

^[*]\w[*]$

So it looks like:

REGEXP_REPLACE(col, '^[*]\w[*]$', '')

Could anyone advise?

Thank you!

I fail to see why "text mining" cannot handle the original column, but that is an entirely different matter. — Gordon Linoff
– Gordon Linoff, Commented Feb 8, 2018 at 13:17
@GordonLinoff You are exactly correct there...but I am just trying to do as I am told — artemis
– artemis, Commented Feb 8, 2018 at 13:17
Never, at least not that I have EVER seen @GordonLinoff. However, I think the solutions below worked! — artemis
– artemis, Commented Feb 8, 2018 at 13:21

Wiktor Stribiżew · Accepted Answer · 2018-02-08 13:16:55Z

3

You may use this approach to remove 1+ occurrences of *...* substrings in your column:

SELECT REGEXP_REPLACE(
   'EMPLOYEE FOUND BROKEN HANDLE ON HAMMER * 02-08-18 15:19:22 PM * I found a hammer that had the wood split on the handle, tossed into scrap.', 
   '\s*\*[^*]*\*', 
   ''
) as Result from dual

See the online demo

Pattern details

\s* - 0+ whitespaces
\* - a * char
[^*]* - 0+ chars other than *
\* - a * char.

See the regex demo.

answered Feb 8, 2018 at 13:16

Wiktor Stribiżew

631k41 gold badges502 silver badges632 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

artemis Over a year ago

Thank you very much for this and for your explanation! This is great!

Gordon Linoff Over a year ago

This is a safer solution because it stops at the next * rather than the last one.

artemis Over a year ago

Wiktor, this is fantastic and I am going to move forward from this. I also appreciate you taking the time to break out the "pattern details" for me. I learn a lot on this and am appreciative of your time.

Wiktor Stribiżew Over a year ago

@bm0r3son Note that * is a special char (an operator, called a quantifier) denoting 0 or more occurrences of the pattern it modifies), thus you need to escape it to match as a literal * char (like "\*"), but if it is inside a character set, bracket expression, you do not have to escape it ("[*]").

Aleksej · Accepted Answer · 2018-02-08 13:21:22Z

2

This could be a way:

select regexp_replace(yourString, '\*.*\*', '') from yourTable

Please notice that this will remove everything between the first and the last '*' in the string; for example:

with test(x) as (
select 'Something * something else * and a * just before another * and something more' from dual
)
select regexp_replace(x, '\*.*\*', '') from test

gives:

Something  and something more

edited Feb 8, 2018 at 13:21

answered Feb 8, 2018 at 13:15

Aleksej

23.1k6 gold badges38 silver badges41 bronze badges

4 Comments

artemis Over a year ago

I employed this method and it worked perfectly. It is simple and effective. Would it be better to use Wiktor's regular expression above? This method worked amazing for me. Thank you!

Wiktor Stribiżew Over a year ago

@bm0r3son This \*.*\* will make Hi out of Hi *Tom*, have you met *Jim?* because .* matches up to the last * occurrence.

Aleksej Over a year ago

@bm0r3son: the two expressions do different things, it only depends on what best suits your need

artemis Over a year ago

Thank you so much!

Collectives™ on Stack Overflow

Oracle SQL - Regular Expression matching using REGEXP_REPLACE()

2 Answers 2

4 Comments

4 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

4 Comments

4 Comments

Your Answer

Sign up or log in

Post as a guest

Related