1. Field of the Invention
This invention relates to the method of collating words using phonetic symbols, to produce word lists in which the listed words' articulations of syllables and phonemes are arranged sequentially as close to each other phonetically as possible.
2. Background Art
Beginning several centuries ago, publishers have produced and sold dictionaries to the public. These dictionaries placed the word entries in alphabetic order and provided information: definition, grammar, pronunciation, origin. The dictionaries' primary purpose has been to aid in selecting the best word for the thought with its correct spelling.
Dictionaries are a great aid in the effort to pronounce new words or to correct speaking errors. The way dictionaries do this is to provide phonic symbols for each sound used in the word. In simple languages like Spanish the spelling symbols—the alphabet—matches the phonic symbols. In complex languages such as English the spelling symbols number 26 while the language uses a minimum of 43 phonic symbols to represent the individual sounds. Some languages over the centuries have increased their spelling symbols such as the Danish which went from 26 to 29.
Dictionaries' words are arranged in alphabetic order because the books are used primarily as an aid to written language usage as opposed to the auditory language usage. Therefore the dictionaries word order is based on the use of spelling symbols and not phonic symbols, and this leads to two problems. Because in some languages the sheer number of possible spellings of individual phonic sounds making it difficult to spell a word correctly in the situation where one has never seen the word before and yet has to write the word based on its pronunciation. In vowel transposition in English for example, the short vowel sound i has ten spelling forms: i, e, o, u, y, ee, e, e i., ie, and ui. Another difficult area when looking up a word, to one with limited visual experience in English, is the silent first letter of a word like the “k” in know, or the “p” in psychiatry. Needless to say, without knowledge of the spelling of “know” or “psychiatry” one would find it hard to locate these words in a present day dictionary.
Presently, with use of computers, digital machines, computer programs and software, society has developed digital dictionaries and digital spell checkers. Together, these developments have provided the data bases or large lexicons and the means to mine the data for spelling options. In doing so, methods have been made to sort through the letter-sound correspondence of a given language orthography. Yet again, the methods center on finding the most probable spelling correction. My invention provides the means to list words by a languages' pronunciation. Printing word listings by pronunciations has the advantage of arranging words so one can understand the cause and effect-relationships—of individual sounds, combinations of sounds, blended sounds, syllable sounds and word sounds to the written word. In English for example, the dynamic of approximately 40 sounds represented and shared by 26 writing characters, with those characters and character combinations making up to nearly 400 graphic representations of the 40 sounds is complex orthographic interplay of phonemes and graphemes to say the least.
This invention is the method of collating words of Indo-European languages by using a special mark for the space between syllables and the languages' phonetic symbols, to produce word lists in which the listed words' articulations of syllables and phonemes are arranged sequentially as close to each other phonetically as possible. While most word list are arranged alphabetic based on an order of letters and the number of letters used in the languages' spelling system, this method would arrange word list based on an order of marks & symbols, of the number of unique sounds used in the languages' pronunciation system. The stark contrast is the alphabetic system results in word list with words arranged as close to each other spelling wise as can be, however this invention's method results in word list with word arranged as close to each other sound wise as can be.
In using my design invention an English dictionary for an example would organize words collating and listing them by phonetic symbols, dividing them into about 43 chapter one for each English sound instead of the usual dictionary format of a chapter by each letter. Furthermore, each chapter could be subdivided, according to words beginning with one of the 18 vowel sounds in the vowel chapters and the next sub-chapters would be the listing of any of the possible 25 accompanying consonants in the second character position. Likewise, in the 25 consonant chapters, the sub-chapter could number 18, one for each accompanying vowel. In a number of many cases there would be no first word existing for some of those sub-chapter phoneme combinations. See
To locate a word one would first focus on a word's first two sounds narrowing the search to one of the 43 chapters, and then narrow it again to either 18 or 25 sub-chapters. The point is that within those sub-chapters one will find the closest like sound words grouped together, and thus come upon a vast expanded pronunciation key. Where the typical pronunciation key in a dictionary has one to three examples of words using a particular phoneme, this invention would maximize the listing examples to the limit of the words in the total data base. A modest 30,000 word list will generate many more than three examples of words beginning with phonemes äk. (See
So to improve the auditory learning and spelling of specifically Indo-European languages I have developed a method of collating words, which insures the closest ordering possible of like sounding words (and syllables) for a given language and given number of words. The method of collating words is done by comparing the words' phonic symbols and syllabication.
The Method of Collating Words With Phonetic Symbols
The first step, is to assign phonetic symbols to represent each sound of a language that one would hear or distinguish. See
Step two, is to assign a rank and a weight to the chosen mark and phonetic symbols in this order: (1) the special mark representing the space between syllables; (2) the vowel, diphthong sounds and semi-vowels; (3) consonant sounds. An example for the English language would be: {hacek over (a)},
Step three, is to gather together the number of words of a single language, in typographical form into a data base, that one wishes to place in phonetic word order, with both the individual words', phonetic and orthographic version paired together. See
Step four, involves collating all the words' phonetic symbols either by: (1) hand collating by human intuition comparing by ranking; (2) machine collating with the aid of a computer program that uses a string comparison function with variable mark and phonetic symbols weights. Collating begins by comparing two words' phonetic symbols. One first does so by comparing the string of phonetic symbols with the same position in each word, beginning with the first phonetic symbol position on the left side in the words. In comparing, one is looking for the first difference or first non-matching sequential phonetic symbols. Once the difference of phonetic symbols is found the ranking or phonetic symbol's weight of those mismatched symbols determines the order between the words, regardless of the remaining symbols to the right-hand side. See
Step five, is to display the results of the collating either with an electronic machine display, printed version or published manuscript. For an example see