I am working on an Urdu to Hindi dictionary and I have created the following file structure:
It so happens that in some cases (around 6000+ in a file of over 200,000+ the glosses are duplicated.
Since this may be a recurrent phenomenon, could a macro or a script be deployed which could check the glosses on the right hand side and if there are duplicates, remove the same and maintain only a single gloss.
An example will make this clear:
Input
Th expected output would be
In case live data is need here is a sample:
I have a ver. 15.20 of Ultraedit.
Many thanks for a macro or a script.
Code: Select all
Headword=Gloss1,Gloss2,Gloss3
i.e. glosses delimited by a comma.
Since this may be a recurrent phenomenon, could a macro or a script be deployed which could check the glosses on the right hand side and if there are duplicates, remove the same and maintain only a single gloss.
An example will make this clear:
Input
Code: Select all
a=b,c,b
d=p,q,p
e=z,y,g,z,g,y
Code: Select all
a=b,c
d=p,q
e=g,y,z
Code: Select all
آبادِیوں=आबादिओं,आबादियों
آبادی=जनसंख्या,आबादी
آبجیکشن=ऑबजेक्शन,ऑब्जेक्शन
آبلا=अबला,उबला
آبو=आबू,आबो
آتشک=आतशक,आतिशक
آتم=आतम,आतम,आत्म,आत्म
آتون=आतून,आतोन
آتیں=आतीं,आतें,आतें,आतीं
آجا=आ जा,आजा
آجاتی=आ जाती,आजाती
آجانا=आ जाना,आजाना
آجکل=आज कल,आजकल
آخری=अंतिम,आख़री
آد=आद,आद,आदि
Many thanks for a macro or a script.