PHP how to Remove non-language Characters from a String?

Question

how can i to remove all characters non-language ?

i want to remove characters like this below, and all other of not language characters:



i using this:

preg_replace("/[^a-z0-9A-Z\-\'\|\!\.\?\:\)\(\;\*\"]/u", " ", $text );

this is good for english, i need to approve all language characters, like Russian,arabic,hebrew,japan...

Are there any string functions I can use to leave all language characters?

thanks

What you have there are code points in the private use area. By "non-language characters", do you mean characters that are not typically used, like private use area code points? Or any symbols, like "☃"? What about "→"? That's useful in written text. — deceze
– deceze ♦, Commented Jan 25, 2012 at 11:32
yes, i want to remove all symbols and other are not typically used in regular keyboard, like A-Z i'm using, but for all languages — motioz
– motioz, Commented Jan 25, 2012 at 11:35
How far do you want to go for "text"? There are giant sections for lots of typography related things, which is arguably language related. What's the primary goal/reason for this? — deceze
– deceze ♦, Commented Jan 25, 2012 at 11:36

Tim Pietzcker · Accepted Answer · 2012-01-25 11:38:43Z

11

No regex will be perfect for what you want - language and writing are just too complex for this. But an approximation could be

preg_replace('/[^\p{L}\p{M}\p{Z}\p{N}\p{P}]/u', ' ', $text);

This will replace anything by a space that's not a Unicode character with one of the properties “letter”, “mark”, “separator”, “number” or “punctuation”.

answered Jan 25, 2012 at 11:38

Tim Pietzcker

337k59 gold badges520 silver badges572 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Terry Lin · Accepted Answer · 2015-05-16 20:05:37Z

1

Tim Pietzcker's answer not working in my case.

This works.

$after = preg_replace('/[^\w\s]+/u','' , $before);

answered May 16, 2015 at 20:05

Terry Lin

2,60725 silver badges21 bronze badges

Collectives™ on Stack Overflow

PHP how to Remove non-language Characters from a String?

2 Answers 2

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related