Claims
- 1. A method of condensing text comprising the steps of:
determining at least one text structure for a text; determining a parsing grammar, a generation grammar and transformation rules; determining a packed structure based on the determined text structures; determining a reduced packed structure based on the transformations of the packed structure; determining candidate structures based on the reduced packed structure and a disambiguation model; determining grammatical condensed text structures based the generation grammar and the candidate structures.
- 2. The method of claim 1, wherein the parsing grammar is at least one of:
a lexical functional grammar, a phrase structure grammar and a dependency grammar.
- 3. The method of claim 1, wherein the generation grammar is at least one of: a lexical functional grammar, a phrase structure grammar and a dependency grammar.
- 4. The method of claim 1, wherein the transformation rules perform at least one of: adding, deleting, modifying and linguistically transforming.
- 5. The method of claim 1, wherein the candidate structures are determined based on at least one of: a statistical, a semantic, a syntactic and a lexical disambiguation.
- 6. The method of claim 1, wherein the packed structure is at least one of:
a lexical functional grammar, a phrase structure grammar and a dependency grammar.
- 7. The method of claim 1, wherein the transformation rules operate on elements of the packed structure.
- 8. The method of claim 7, wherein the elements are linguistic facts and associated linguistic context.
- 9. The method of claim 8, where the transformations rules are applied directly to the linguistic facts of the packed structure.
- 10. The method of claim 7, wherein the step of applying the transformation rules to the packed structure further comprises determining linguistic facts and context of the packed structure and applying the transformation rules to the determined linguistic facts and context.
- 11. A system of condensing text comprising:
an input/output circuit for receiving a text; a memory; a processor that determines text structures for the text; a parsing grammar circuit that determines a parsing grammar; a packed structure circuit that determines a packed structure for each text structure based on the parsing grammar; a reduced structure circuit that determines reduced structures for each packed structure based on at least one transformation rule; a candidate structure circuit that determines candidate structures based on the reduced structures and a disambiguation model; a generation grammar circuit that determines a generation grammar; and a grammatical condensed text structure circuit that determines a grammatical condensed text structure based on the determined generation grammar and the candidate structures.
- 12. The system of claim 11, wherein the parsing grammar circuit determines a parsing grammar from at least one of: a lexical functional grammar, a phrase structure grammar and a dependency grammar.
- 13. The system of claim 11, wherein the generation grammar circuit determines a generation grammar from at least one of: a lexical functional grammar, a phrase structure grammar and a dependency grammar.
- 14. The system of claim 11, wherein the transformation rules perform are at least one of: adding, deleting, modifying and linguistically transforming.
- 15. The system of claim 11, wherein the candidate structures circuit determines candidate structures based on at least one of: a statistical, a semantic, a syntactic and a lexical disambiguation.
- 16. The system of claim 11, wherein the packed structure circuit determines packed structures based on at least one of: a text structure, a sentencial structure, a paragraph structure.
- 17. The system of claim 11, wherein the processor applies transformation rules to elements of the packed structure.
- 18. The system of claim 17, wherein the elements of the packed structure are linguistic facts and linguistic context.
- 19. The system of claim 18, where the transformations rules are applied directly to the linguistic facts and linguistic context of the packed structure.
- 20. The system of claim 17, wherein the processor unpacks the linguistic facts and linguistic context of the packed structure and applies the transformation rules to the determined linguistic facts and linguistic context.
- 21. Computer readable storage medium comprising: computer readable program code embodied on the computer readable storage medium, the computer readable program code usable to program a computer for grammatical text condensation comprising the steps of:
determining at least one text structure for a text; determining a parsing grammar, a generation grammar and transformation rules; determining a packed structure based on the determined text structures; determining a reduced packed structure based on the transformations of the packed structure; determining candidate structures based on the reduced packed structure and a disambiguation model; determining grammatical condensed text structures based the generation grammar and the candidate structures.
- 22. A carrier wave encoded to transmit a control program, useable to program a computer for grammatical text condensation, to a device for executing the program, the control program comprising:
instructions determining at least one text structure for a text; instructions determining a parsing grammar, a generation grammar and transformation rules; instructions determining a packed structure based on the determined text structures; instructions determining a reduced packed structure based on the transformations of the packed structure; instructions determining candidate structures based on the reduced packed structure and a disambiguation model; instructions determining grammatical condensed text structures based the generation grammar and the candidate structures.
- 23. The method of claim 1, wherein the text structure is at least one of a sentence, a paragraph, selected portions of a text and a discourse.
- 24. The method of claim 23, wherein text structures are selected based on at least one of a discourse grammar and a statistical model.
- 25. The system of claim 11, wherein the text structure is at least one of a sentence, a paragraph, selected portions of a text and a discourse.
- 26. The system of claim 23, wherein text structures are selected based on at least one of a discourse grammar and a statistical model.
INCORPORATION BY REFERENCE
[0001] This Application incorporates by reference:
[0002] U.S. patent application Ser. No. 10/338,846, entitled “SYSTEMS AND METHODS FOR EFFICIENT CONJUNCTION OF BOOLEAN VARIABLES” by Maxwell III, John T., filed Jan. 9, 2003;
[0003] U.S. patent application Ser. No. 09/883,345, entitled “SYSTEM AND METHOD FOR GENERATING ANALYTIC SUMARIES” by Polanyi et al., filed Jun. 19, 2001;
[0004] U.S. patent application Ser. No. 09/689,779, entitled “SYSTEM AND METHOD FOR GENERATING TEXT SUMMARIES” by Polanyi et al.;
[0005] U.S. Pat. No. 5,778,397, entitled “AUTOMATIC METHOD OF GENERATING FEATURE PROBABILITIES FOR AUTOMATIC EXTRACTING SUMMARIZATION” by Kupiec et al., filed Jun. 28, 1995;
[0006] U.S. Pat. No. 5,918,240, entitled “AUTOMATIC METHOD OF EXTRACTING SUMMARIZATION USING FEATURE PROBABILITIES” by Kupiec et al., filed Jun. 28, 1995;
[0007] U.S. Pat. No. 5,689,716 entitled “AUTOMATIC METHOD OF GENERATING THEMATIC SUMMARIES” by Chen et al., filed Apr. 14, 1995; and
[0008] U.S. Pat. No. 5,745,602, entitled “AUTOMATIC METHOD OF SELECTING MULTI-WORD KEYPHRASES FROM A DOCUMENT” by Chen et al., filed May 1, 1995; each in their entirety.