- Posted by
- I have a whole set of regexes given below which I want to run all together instead of running each one separately. These allow me to check for mismatches within a dictionary and ensure that a wrong word is not mapped to the headword entry. I tried to write a macro with each regex listed and with a l...Posted in Macros
-   Topics
-   Views
- dictdoc
Jan 04, 2014
- Dear Mofi, I tried the script out with and without the commented lines. It worked just great. At present my data is in English so there are no issues. I tried it on a small file in Unicode following your suggestions and it worked just fine. Many thanks. Schoene Weihnachten und Alles Gutes fuer das N...Posted in Scripts
-   Topics
-   Views
- dictdoc
Dec 22, 2013
- My problem is as described below. I am working on name variants and one of the tools developed in C is a Metaphone Engine. Basically the engine looks for name variants and conjoins them to provide similar homographs. The engine is designed for Indian languages but the examples given below are in Eng...Posted in Scripts
-   Topics
-   Views
- dictdoc
Dec 20, 2013
- Hello, I am trying to sort a file in Urdu by the character by which it ends. Each word is on a separate line An example in English would help: lychee fruit banana apple pear cherry I need the sort to be on the last letter of the word and then sorted recursively in reverse order banana lychee [i]This...Posted in Macros
-   Topics
-   Views
- dictdoc
Oct 07, 2013
- I am working on an Urdu to Hindi dictionary and I have created the following file structure: Headword=Gloss1,Gloss2,Gloss3 i.e. glosses delimited by a comma. It so happens that in some cases (around 6000+ in a file of over 200,000+ the glosses are duplicated. Since this may be a recurrent phenomenon...Posted in Scripts
-   Topics
-   Views
- dictdoc
Aug 18, 2013
- Sorry for the late response. I was hospitalised for a month and had no access to the internet. I have just got back and checked the completed script and it works very well. Thank you for your kind help and once again my excuses for this late response.Posted in Scripts
-   Topics
-   Views
- dictdoc
Jul 11, 2013
- Hello, I am compiling an open-source stemmer dictionary for English and eventually for other Indian languages. The Engine which I have written has spewed out all lemmatised/expanded forms of the words: Nouns, Adjectives, Adverbs etc. Each set of expanded forms is separated by a hard return. Since ea...Posted in Scripts
-   Topics
-   Views
- dictdoc
May 24, 2013
- Hello, I have a file which has the following structure word space frequency The file is around 30,000 headwords each along with its frequency. The words have different lengths. What I need is a script which can sort the file on length of the headword and once the file is sorted on length: smallest t...Posted in Scripts
-   Topics
-   Views
- dictdoc
Mar 22, 2013
- Hello, Splitting a sentence using the full-stop/question-mark/exclamation is a common device. Whereas the question-mark / exclamation do not pose too much of a problem; the full-stop as a sentence delimiter raises certain issues because of its varied use, as shown in the examples below: The temperat...Posted in Find/Replace/Regular Expressions
-   Topics
-   Views
- dictdoc
Jul 29, 2012
- Dear all, I am working with names and I have a large file of names in which some words are written together (upto 4 or 5) and their corresponding single forms are also present in the word-list. An example would make this clear annamarie mariechristine johnsmith johnjoseph smith john smith anna marie...Posted in Scripts
-   Topics
-   Views
- dictdoc
May 02, 2012
- Hello, I have a database with the following structure: Headword in a foreign language followed by frequency (delimited with comma) and eventually followed by a list of glosses in English, each gloss delimited with a comma. A small sample is given below कुर्यवंशी,4,kuryanshi,kuryavanashi,kuryawanshi,...Posted in Macros
-   Topics
-   Views
- dictdoc
Apr 29, 2012