Dear All,
I want to find the line that was not in sort order.
I have converted some pdf file into xml. Now I have to find out the split lines. The file was sorted by either Name of the author or numbered list.
If one sentence was split into two, the second line must have a great chance to be not in alphabetical order and I need a solution to find it.
Eg.,
Here I want to find the 3rd line because it was split from previous line.
And another example.
Here I have to find out 4th and 6th line. The file have thousands of lines. Is there any workaround to find this.
Thanks in advance.
Arun
I want to find the line that was not in sort order.
I have converted some pdf file into xml. Now I have to find out the split lines. The file was sorted by either Name of the author or numbered list.
If one sentence was split into two, the second line must have a great chance to be not in alphabetical order and I need a solution to find it.
Eg.,
Code: Select all
1. author 1 sample text 1920. England.
2. Arun sample text
1951. France.
3. Kumar sample text 1854 America.
And another example.
Code: Select all
Arun sample text 1951. France.
Author1 sample text. 1920. England.
Author5 sample text
1920. England.
Kumar sample text 1854.
America.
Mofi sample text 2014, Austria.
Thanks in advance.
Arun