I'm trying to find out whether there are dot, comma, semicolon or colon just before the closing tag </title> in my file except when there is a semi-colon which does not represent a 4 digit hex entity.
Text sample
The search pattern should positively match:
I tried this Perl regular expression search string: ([.,:]|((?<!&#x[0-9a-z][0-9a-z][0-9a-z][0-9a-z]);))</title>
Can someone help me make it shorter.
Text sample
Code: Select all
<title>Dhind</title>
<title>WT.</title>
<title>Plant Leaves:</title>
<title>Denia;</title>
<title>Erodé</title>
Code: Select all
<title>WT.</title>
<title>Plant Leaves:</title>
<title>Denia;</title>
Can someone help me make it shorter.