I need to recognize and reformat email addresses (and other similar data formats) in a text file. The information in this file is divided into 2 columns, but a single data structure may span multiple lines in one column. The columns are separated by a tab character. The last lines of each 'record' in this file may exist but will span 2 columns.
Example:
I am able to recognize the email address by using the UltraEdit regular expression [^.^-_0-9a-z]+^@[^.^-_0-9a-z^p]+.[a-z]+, even over multiple lines if there are no columns. But I'm curious if I can do a search over multiple lines in a specific column. And if possible, how do I eliminate the tab or CR/LF in the email address string?
A straight forward approach could be to start with reformatting the file into one column (for example, moving the right column under the last line of the left column), but there is the difficulty that the last line may have free text, which makes it difficult to recognize the last line of the left column.
Anybody have an idea?
Art
Example:
I am able to recognize the email address by using the UltraEdit regular expression [^.^-_0-9a-z]+^@[^.^-_0-9a-z^p]+.[a-z]+, even over multiple lines if there are no columns. But I'm curious if I can do a search over multiple lines in a specific column. And if possible, how do I eliminate the tab or CR/LF in the email address string?
A straight forward approach could be to start with reformatting the file into one column (for example, moving the right column under the last line of the left column), but there is the difficulty that the last line may have free text, which makes it difficult to recognize the last line of the left column.
Anybody have an idea?
Art