METHOD AND ELECTRONIC DEVICE FOR PROCESSING USER UTTERANCE BASED ON AUGMENTED SENTENCE CANDIDATES

Information

  • Patent Application
  • 20230297786
  • Publication Number
    20230297786
  • Date Filed
    March 20, 2023
    a year ago
  • Date Published
    September 21, 2023
    a year ago
  • CPC
    • G06F40/40
    • G06F40/20
  • International Classifications
    • G06F40/40
    • G06F40/20
Abstract
An electronic device is provided. The electronic device includes a memory including instructions and a processor electrically connected to the memory and executing the instructions, in which, when the instructions are executed by the processor, the processor obtains intent information corresponding to an utterance of a user, calculates a reliability of the intent information, generates one or more augmented sentence candidates including at least a portion of the utterance based on one or more language models in response to the reliability being less than a threshold value, and provides a response to the user based on the augmented sentence candidates.
Description
Claims
  • 1. An electronic device, comprising: a memory comprising instructions; andat least one processor electrically connected to the memory and configured to execute the instructions,wherein, when the instructions are executed by the at least one processor, the at least one processor is configured to: obtain intent information corresponding to an utterance of a user,calculate a reliability of the intent information,in response to the reliability being less than a threshold value, generate one or more augmented sentence candidates comprising at least a portion of the utterance based on one or more language models, andprovide a response to the user based on the augmented sentence candidates.
  • 2. The electronic device of claim 1, wherein the reliability is less than the threshold value when the utterance includes only a noun or a predicate.
  • 3. The electronic device of claim 1, wherein the reliability is less than the threshold value when the utterance includes a homonym.
  • 4. The electronic device of claim 1, wherein the one or more language models comprise at least one of: a first language model which is a basic language model,a second language model generated based on learning data for each domain,a third language model generated based on a user history, ora fourth language model generated with a social trend reflected therein.
  • 5. The electronic device of claim 4, wherein the one or more language models comprise: a fifth language model generated by a combination of two or more language models among the first language model, the second language model, the third language model, and the fourth language model.
  • 6. The electronic device of claim 1, wherein a reliability of intent information corresponding to each of the one or more augmented sentence candidates is greater than the threshold value.
  • 7. The electronic device of claim 1, wherein the at least one processor is configured to: obtain information associated with the utterance,preprocess an utterance obtained through a conversion into a form of text based on the information, andgenerate the one or more augmented sentence candidates based on a result of the preprocessing and the one or more language models.
  • 8. The electronic device of claim 1, wherein the at least one processor is further configured to: output a question generated based on a plurality of augmented sentence candidates.
  • 9. The electronic device of claim 1, wherein the at least one processor is further configured to: select at least one augmented sentence from among the one or more augmented sentence candidates, andoutput the response generated based on the at least one augmented sentence.
  • 10. The electronic device of claim 9, wherein the at least one processor is further configured to: for each of the one or more augmented sentence candidates, select the at least one augmented sentence by verifying a confidence value, verifying an uncertainty value, verifying an augmented history, or verifying a checklist.
  • 11. An operation method of an electronic device, comprising: obtaining intent information corresponding to an utterance of a user;calculating a reliability of the intent information;in response to the reliability being less than a threshold value, generating one or more augmented sentence candidates comprising at least a portion of the utterance based on one or more language models; andproviding a response to the user based on the augmented sentence candidates.
  • 12. The operation method of claim 11, wherein the reliability is less than the threshold value when the utterance includes only a noun or a predicate.
  • 13. The operation method of claim 11, wherein the reliability is less than the threshold value when the utterance includes a homonym.
  • 14. The operation method of claim 11, wherein the one or more language models comprise at least one of: a first language model which is a basic language model;a second language model generated based on learning data for each domain;a third language model generated based on a user history; ora fourth language model generated with a social trend reflected therein.
  • 15. The operation method of claim 14, wherein the one or more language models comprise: a fifth language model generated by a combination of two or more language models among the first language model, the second language model, the third language model, and the fourth language model.
  • 16. The operation method of claim 11, wherein a reliability of intent information corresponding to each of the one or more augmented sentence candidates is greater than the threshold value.
  • 17. The operation method of claim 11, wherein the generating comprises: obtaining information associated with the utterance;preprocessing an utterance obtained through a conversion into a form of text based on the information; andgenerating the one or more augmented sentence candidates based on a result of the preprocessing and the one or more language models.
  • 18. The operation method of claim 11, wherein the providing of the response comprises: outputting a question generated based on a plurality of augmented sentence candidates.
  • 19. The operation method of claim 11, wherein the providing comprises: selecting at least one augmented sentence from among the one or more augmented sentence candidates; andoutputting the response generated based on the at least one augmented sentence.
  • 20. The operation method of claim 19, wherein, for each of the one or more augmented sentence candidates, the selecting comprises at least one of: verifying a confidence value;verifying an uncertainty value;verifying an augmented history; orverifying a checklist.
Priority Claims (2)
Number Date Country Kind
10-2022-0034180 Mar 2022 KR national
10-2022-0057653 May 2022 KR national
Continuations (1)
Number Date Country
Parent PCT/KR2022/020837 Dec 2022 WO
Child 18186511 US