This application is based on and claims priority under 35 U.S.C. § 119 of an Indian patent application number 201941036011, filed on Sep. 6, 2019, in the Indian Intellectual Property Office, the disclosure of which is incorporated by reference herein in its entirety.
The disclosure relates to performing actions in an electronic device. More particularly, the disclosure relates to a method and an electronic device for generating at least one data item from a screenshot.
In general, screenshots provide a view of a user interface of an electronic device (e.g., a view of an application window or a view of a computer desktop). When a user initiates a capture of a screenshot, the electronic device will create an image of a current state of the user interface of the electronic device. The user can then use the screenshot in a body of a document, share the screenshot etc.
However, the user may generally want to share only a portion of the screenshot and not the entire screenshot with another user or the user intends to use only the portion of the screenshot in the documents. In order to be able to use only the portion of the screenshot, the user will have to first capture the screenshot (e.g., as shown in screenshot 1 of
The above information is presented as background information only to assist with an understanding the disclosure. No determination has been made, and no assertion is made, as to whether any of the above might be applicable as prior art with regard to the disclosure.
Aspects of the disclosure are to address at least the above-mentioned problems and/or disadvantages and to provide at least the advantages described below. Accordingly, an aspect of the disclosure is to provide a method for generating at least one data item from a screenshot.
Another aspect of the disclosure is to determine a plurality of data item parameters associated with content in the screenshot.
Another aspect of the disclosure is to generate at least one data item from the screenshot based on the plurality of data item parameters.
Another aspect of the disclosure is to store the at least one data item in a file in the electronic device.
Another aspect of the disclosure is to perform at least one action in the electronic device using the at least one data item.
Another aspect of the disclosure is to determine whether the content displayed in a first image is complete based on a plurality of attributes.
Another aspect of the disclosure is to automatically scroll the content in at least one direction in response to determining that the content displayed in the first image is not complete, to capture the complete content in the screenshot.
Additional aspects will be set forth in part in the description which follows and, in part, will be apparent from the description, or may be learned by practice of the presented embodiments.
The method includes detecting, by the electronic device, a screenshot comprising content displayed on a screen of the electronic device and determining, by the electronic device, a plurality of data item parameters associated with the content in the screenshot. Further, the method also includes generating, by the electronic device, at least one data item from the screenshot based on the plurality of data item parameters, and storing, by the electronic device (100), the at least one data item in a file in the electronic device.
In an embodiment, the method further includes performing, by the electronic device, at least one action in the electronic device using the at least one data item.
In an embodiment, the method for detecting, by the electronic device, the screenshot comprising the content displayed on a screen of the electronic device includes receiving a first image of the content displayed on a screen of the electronic device and determining a plurality of attributes associated with the content displayed in the first image of the screen of the electronic device. Further, the method also includes determining whether the content displayed in the first image is complete based on the plurality of attributes and displaying an indication to scroll the content in at least one direction in response to determining that the content displayed in the first image is not complete. Furthermore, the method also includes receiving at least one second image of a scrolled content displayed on the screen of the electronic device, and detecting the screenshot based on the first image and the at least one second image.
In an embodiment, the method for generating, by the electronic device, the at least one data item from the screenshot based on the plurality of data item parameters includes displaying a plurality of objects available in the screenshot based on the plurality of data item parameters and detecting at least two objects from the plurality of objects selected by a user. Further, the method also includes selecting the at least two objects and generating a data item by automatically merging the at least two selected objects.
In an embodiment, the method for generating, by the electronic device, at least one data item from the screenshot based on the plurality of data item parameters includes determining a context associated with a plurality of objects available in the screenshot based on the plurality of data item parameters and determining a weightage of each of the objects based on the context. Further, the method also includes determining a set of objects from the plurality of objects for which the weightage meets a data item threshold, and generating at least one data item by automatically merging the set of objects.
In an embodiment, the plurality of data item parameters includes an outline of the content, proximity of the objects of the content and attributes of the content.
In an embodiment, the outline of the content includes information about at least one of edges of objects available in the screenshot (e.g., screen shot), line segments of objects available in the screenshot, angle of objects available in the screenshot, or a screen size of objects available in the screenshot.
In an embodiment, the attributes of the content include metadata information of the objects available in the screenshot.
In an embodiment, the proximity of the objects of the content indicates a distance between the objects available in the screenshot.
In accordance with another aspect of the disclosure, an electronic device for generating at least one data item from a screenshot is provided. The electronic device includes an imaging sensor, a memory and at least one processor coupled to the memory. The at least one processor is configured to detect a screenshot comprising content displayed on a screen of the electronic device, determine a plurality of data item parameters associated with the content in the screenshot, generate at least one data item from the screenshot based on the plurality of data item parameters, and store the at least one data item in a file in the electronic device.
Other aspects, advantages, and salient features of the disclosure will become apparent to those skilled in the art from the following detailed description, which, taken in conjunction with the annexed drawings discloses various embodiments of the disclosure.
The above and other aspects, features, and advantages of certain embodiments of the disclosure will be more apparent from the following description taken in conjunction with the accompanying drawings, in which:
Throughout the drawings, it should be noted that like reference numbers are used to depict the same or similar elements, features, and structures.
The following description with reference to the accompanying drawings is provided to assist in a comprehensive understanding of various embodiments of the disclosure as defined by the claims and their equivalents. It includes various specific details to assist in that understanding but these are to be regarded as merely exemplary. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the various embodiments described herein can be made without departing from the scope and spirit of the disclosure. In addition, descriptions of well-known functions and constructions are omitted for clarity and conciseness.
The terms and words used in the following description and claims are not limited to the bibliographical meanings, but, are merely used by the inventor to enable a clear and consistent understanding of the disclosure. Accordingly, it should be apparent to those skilled in the art that the following description of various embodiments of the disclosure is provided for illustration purpose only and not for the purpose of limiting the disclosure as defined by the appended claims and their equivalents.
It is to be understood that the singular forms “a,” “an,” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a component surface” includes reference to one or more of such surfaces.
As is traditional in the field, embodiments may be described and illustrated in terms of blocks which carry out a described function or functions. These blocks, which may be referred to herein as units, engines, manager, modules or the like, are physically implemented by analog and/or digital circuits such as logic gates, integrated circuits, microprocessors, microcontrollers, memory circuits, passive electronic components, active electronic components, optical components, hardwired circuits and the like, and may optionally be driven by firmware and/or software. The circuits may, for example, be embodied in one or more semiconductor chips, or on substrate supports such as printed circuit boards and the like. The circuits constituting a block may be implemented by dedicated hardware, or by a processor (e.g., one or more programmed microprocessors and associated circuitry), or by a combination of dedicated hardware to perform some functions of the block and a processor to perform other functions of the block. Each block of the embodiments may be physically separated into two or more interacting and discrete blocks without departing from the scope of the disclosure. Likewise, the blocks of the embodiments may be physically combined into more complex blocks without departing from the scope of the disclosure.
Referring to
In the methods and systems of the related art, when a user captures a screenshot of a user interface of an application and the user wants to share only a portion of the screenshot, then the user will have to manually crop the irrelevant portions of the screenshot and then share the relevant portion of the screenshot.
Unlike to the methods and systems of the related art, in the proposed method the electronic device (100) automatically determines at least one data item from the screenshot and provides the at least one data item to the user which can be stored, or shared, printed, etc., based on user requirement.
Unlike to the methods and systems of the related art, in the proposed method the electronic device (100) intelligently merges text block and an image block based on similarity of content and generates the at least one data item from the screenshot.
Unlike to the methods and systems of the related art, in the proposed method the user can drag and drop the screenshot comprising at least one data item onto an application and the electronic device (100) automatically determines a data item of the at least one data item which is associated with the application and performs the at least one action.
Referring now to the drawings, and more particularly to
Referring to
In an embodiment, the imaging sensor (120) is configured to capture the screenshot of a user interface (UI) of the electronic device (100).
In an embodiment, the processor (140) is configured to detect the screenshot comprising content displayed on a screen of the electronic device (100) and determine a plurality of data item parameters associated with the content in the screenshot. The content displayed on the screen of the electronic device (100) comprises at least one of an image of an article, a text block, advertisement content, spaces between the article and the text block, etc. Further, the processor (140) is also configured to generate at least one data item from the screenshot based on the plurality of data item parameters associated with the content in the screenshot. The data item from the screenshot is a combination comprising at least one of the image block and the text block. The data item parameters associated with the content includes a metadata associated with the content in the screenshot.
Further, the processor (140) is configured to store the at least one data item in a file in the electronic device (100). The at least one data item can be stored in any form based on the generated data item. For example, in case the at least one data item generated is in the form of the image, then the at least one data item is stored as the image. The at least one data item can be generated as the image, a document, a search query, etc., and hence stored accordingly.
Furthermore, the processor (140) is also configured to perform at least one action in the electronic device (100) using the at least one data item. The at least one action is one of sharing the at least one data item generated by the processor (140) using various application in the electronic device (100), creating a search query in the various application in the electronic device (100) using the various application in the electronic device (100) generated by the processor (140), auto-masking of sensitive information in the at least one data item generated by the processor (140) based on a learning of a user behavior, etc.
The processor (140) includes an object detection engine (142), a block completeness determination engine (144), a data item generation engine (146), and an action management engine (148).
The object detection engine (142) further includes a Deep Neural Network (DNN) (142a), an image processing engine (142b) and an object database (142c). The object detection engine (142) is configured to analyze the screenshot captured by the electronic device (100) and detect at least one object in the screenshot. The at least one object includes at least one of an image block and a text block. The exact functioning of the object detection engine (142) is explained in
The block completeness determination engine (144) is configured to determine whether the at least one of the image block and the text block is complete with respect to the content. In case of the image block, the block completeness determination engine (144) determines whether the image block is complete based on object attributes (e.g. metadata) and continuity of the image block. In case of the text block, the block completeness determination engine (144) determines whether the text block complete is based on grammatical and spatial completeness of the text block. In case the block completeness determination engine (144) determines that the at least one object is not complete, then the block completeness determination engine (144) displays an indication to scroll the content in at least one direction to obtain a complete screenshot of the at least one object.
In an embodiment, the data item generation engine (146) is configured to generate the at least one data item from the screenshot captured by the electronic device (100) after the at least one object is determined from the plurality of content in the screenshot and after the block completeness determination engine (144) provides the complete screenshot. The at least one data item may be generated using one of: a plurality data item parameter which includes an outline of the content, proximity of the objects of the content and attributes of the content; a weightage score associated with the image block and the text block of the screenshot; based on a selection of the at least one image block and the at least one text block manually by the user, etc. The data item generation engine (146) includes Sobel filter units (146a), line segment detector (146b), a text encoder (146c), an image encoder (146d), and a weightage determination engine (146e).
The Sobel filter units (146a) and the line segment detector (146b) are used to generate the at least one data item from the screenshot based on the outline of the content. The text encoder (146c) and the image encoder (146d) are used to generate the at least one data item from the screenshot based on the attributes of the content. The weightage determination engine (146e) is used to generate the at least one data item from the screenshot based on the weightage score associated with the image block and the text block of the screenshot which also includes the proximity of the objects of the content.
In an embodiment, the action management engine (148) is configured to perform the at least one action on the at least one data item generated by the data item generation engine (146). The at least one action may be one of sharing the at least one data item, printing the at least one data item, generating a search query using the at least one data item and populating search results, etc.
In an embodiment, the memory (160) can include non-volatile storage elements. Examples of such non-volatile storage elements may include magnetic hard discs, optical discs, floppy discs, flash memories, or forms of electrically programmable memories (EPROM) or electrically erasable and programmable (EEPROM) memories. In addition, the memory (160) may, in some examples, be considered a non-transitory storage medium. The term “non-transitory” may indicate that the storage medium is not embodied in a carrier wave or a propagated signal. However, the term “non-transitory” should not be interpreted that the memory (160) is non-movable. In some examples, the memory (160) is configured to store larger amounts of information than the memory (160). In certain examples, a non-transitory storage medium may store data that can, over time, change (e.g., in Random Access Memory (RAM) or cache).
In an embodiment, the display (180) is configured to display the screenshot on the screen of the electronic device (100). Further, the display (180) is also configured to display the at least one data item generated by the data item generation engine (146).
Although the
Referring to
Further, the templates are fed by the image processing engine (142b) to the DNN (142a) to be used for supervised training for performing object detection in the screenshot.
Referring to
In response to determining that the confidence score of at least one object determined in the screenshot is greater than the threshold of the confidence score, the image processing engine (142b) adds the at least one object to a set of at least one of image/text bounding boxes.
In response to determining that the confidence score of at least one object determined in the screenshot is greater than the threshold of the confidence score, the image processing engine (142b) performs various image processing techniques on the at least one object determined in the screenshot such as blurring, edge detection, contour detection, rectangle enclosing the contour and then adds the at least one object to a set of at least one of image/text bounding boxes.
Referring to
Further, in response to determining that the content displayed in the first image is not complete, the block completeness determination engine (144) displays an indication to scroll the content in at least one direction. The content can be scrolled by the user manually based on the displayed by the block completeness determination engine (144) or the electronic device (100) can also auto-scroll in at least one direction to obtain the complete content. The at least one direction comprises scrolling the content in any of the four directions i.e., scroll to obtain content on top portion of the screen, scroll to obtain content on bottom portion of the screen, scroll to obtain content on left portion of the screen and scroll to obtain content on right portion of the screen. Further, at operation 3 shown in
The complete content refers to a complete article or the complete paragraph of a document. For example, consider that in the first image of the screenshot only a portion of a house is displayed. Then, the second image is taken to obtain the complete image of the house displayed in the first image. Similarly, when the text block displayed in the first image is incomplete (determined based on the content), then the second image is obtained to obtain the complete paragraph of the text block displayed in the first image.
Referring to
At operation 2 shown in
Further, at operation 3 shown in
In response to determining that the segment corresponding to the horizontal line segments crosses the size threshold, the data item generation engine (146), at operation 5 shown in
In response to determining that the segment corresponding to the vertical line segments crosses the size threshold, the data item generation engine (146) recursively detects the horizontal line segment and the vertical line segment (multiple iterations from operation 1 shown in
Referring to
At operation 3 shown in
At operation 4 shown in
Referring to
At operation 4 shown in
At operation 4 shown in
Further, a series of parameters are considered for determining a weighted score by the weightage determination engine (146e) which is in turn used to determine the generation of the at least one data item based on the weighted score. The series of parameters considered for determining the weighted score by the weightage determination engine (146e) includes a size of the at least one object, a distance from the center, image completeness score (ics) which indicates a percentage of the image block being cut or a probability of the image block not being cut, an image relevance score (irs) which performs an image analysis to assign score to image based on relevance of the contents of the image block to the application on which the screenshot is taken, a text relevance score (trs) which is obtained based on an optical character recognition (OCR) and a text analysis to assign score for the text to every image based on the relationship, and a context score (cs) which is a score based on a chat history, back information, learning, etc.
Accordingly, the following equation may be used:
Weighted score(Bi)=w_size**ss(Bi)+w_dfc*dfc(Bi)+w_ics*ics(Bi)+w_irs*irs(Bi)+w_cs*cs(Bi)+w_trs*Σj=0n_text_reqtrs(Bi,tj) Equation 1
Further, the weightage determination engine (146e) of the electronic device (100) determines whether the weighted score meets a data item threshold and in response to determining that the weighted score meets the data item threshold, the at least one data item is generated by the electronic device (100).
Referring to
At operation 320, the electronic device (100) determines the plurality of data item parameters associated with the content in the screenshot. For example, in the electronic device (100) as illustrated in the
At operation 330, the electronic device (100) generates the at least one data item from the screenshot based on the plurality of data item parameters. For example, in the electronic device (100) as illustrated in the
At operation 340, the electronic device (100) stores the at least one data item in the file. For example, in the electronic device (100) as illustrated in the
At operation 350, the electronic device (100) performs the at least one action using the at least one data item. For example, in the electronic device (100) as illustrated in the
The various actions, acts, blocks, steps, or the like in the method may be performed in the order presented, in a different order or simultaneously. Further, in some embodiments, some of the actions, acts, blocks, steps, or the like may be omitted, added, modified, skipped, or the like without departing from the scope of the disclosure.
Referring to
At operation 312, the electronic device (100) determines the plurality of attributes associated with the content displayed in the first image of the screen. For example, in the electronic device (100) as illustrated in the
At operation 313, the electronic device (100) determines whether the content displayed in the first image is complete based on the plurality of attributes. For example, in the electronic device (100) as illustrated in the
At operation 314, the electronic device (100) displays the indication to scroll the content in the at least one direction in response to determining that the content displayed in the first image is not complete. For example, in the electronic device (100) as illustrated in the
At operation 315, the electronic device (100) receives the at least one second image of the scrolled content displayed on the screen. For example, in the electronic device (100) as illustrated in the
At operation 316, the electronic device (100) detects the screenshot based on the first image and the at least one second image. For example, in the electronic device (100) as illustrated in the
Referring to
At operation 334a the electronic device (100) detects the at least two objects from the plurality of objects selected by the user. For example, in the electronic device (100) as illustrated in the
At operation 336a the electronic device (100) generates the data item by automatically merging the at least two selected objects. For example, in the electronic device (100) as illustrated in the
Referring to
At operation 334b the electronic device (100) determines the weightage of each of the objects based on the context. For example, in the electronic device (100) as illustrated in the
At operation 336b the electronic device (100) determines the set of objects from the plurality of objects for which the weightage meets the data item threshold. For example, in the electronic device (100) as illustrated in the
At operation 338b the electronic device (100) generates the at least one data item by automatically merging the set of objects. For example, in the electronic device (100) as illustrated in the
Referring to
At operation 352, the electronic device (100) allows the user to drop the screenshot on the application/service. For example, in the electronic device (100) as illustrated in the
At operation 353, the electronic device (100) gets the application/service data and metadata associated with the application. For example, in the electronic device (100) as illustrated in the
At operation 354, the electronic device (100) finds the at least one data item related to the service/application. For example, in the electronic device (100) as illustrated in the
At operation 355, the electronic device (100) determines whether the appropriate data item is provided to the application/service. For example, in the electronic device (100) as illustrated in the
At operation 356, in response to determining that the appropriate data item is not provided to the application/service, the electronic device (100) allows the user to select another data item.
At operation 357, in response to determining that the appropriate data item is provided to the application/service, the electronic device (100) executes the at least one action based on the application/service into which the screenshot is dropped by the user. For example, in the electronic device (100) as illustrated in the
Referring to
Shown at screenshot 2 of
Referring to
At screenshot 506a of
Referring to
Referring to
At screenshot 604a, the electronic device (100) determines the plurality of attributes associated with the content displayed in the first image such as the color and size of the products displayed, continuity of the product and any other metadata associated with the products displayed in the first image. Further, the electronic device (100) determines that the radio displayed in the first image is not complete based on the plurality of attributes and displays an indication to scroll-up the content to capture the complete radio. The scroll-up can be performed by one of automatically by the electronic device (100) and manually by the user.
At screenshot 606a, the electronic device (100) receives the second image of the radio displayed on the screen of the electronic device (100) along with the other products and generates a complete screenshot based on the first image and the second image. Further, the electronic device (100) generates the at least one data item from the complete screenshot on which the user can perform the at least one action.
Referring to
Referring to
Furthermore, the electronic device (100) determines the properties of the electronic device (100) such as length in pixels, etc., and performs the auto-scroll down of the content in the first image to complete the second text block. At screenshot 604c, the electronic device (100) captures the second image comprising the complete second text block after scrolling down the content in the first image. Further, the electronic device (100) merges the first image and the second image of the second text block and then determines whether the text/image block displayed at the bottom most portion of the screenshot is grammatically and spatially complete.
At screenshot 606c, in response to determining that the text/image block displayed at the bottom most portion of the screenshot is grammatically and spatially complete, the electronic device (100) generates the complete screenshot by merging the first image and the second image at the region of intersection of the first image and the second image. Further, the electronic device (100) determines the data items in the complete screenshot on which the at least one action can be performed.
In response to determining that the text/image block displayed at the bottom most portion of the screenshot is grammatically and spatially incomplete, the electronic device (100) scrolls down further to capture the complete text/image block and then forms the complete screenshot by merging the first image with the at least one second image.
Referring to
At screenshot 704, the electronic device (100) determines each of the text blocks and the image blocks from the contents of the screenshot. At screenshot 706, the electronic device (100) identifies the outlines already present in the screenshot. Further, at screenshot 708, the electronic device (100) merges the text block and the image block which are within the existing outline of the screenshot to form at least one data item and determines whether the at least one data item generated matches the data item threshold. In response to determining that the at least one data item generated matches the data item threshold, the electronic device (100) provides the at least one data item to the user.
In another embodiment, the electronic device (100) determines a context associated with a user communication with the user contact and automatically provides the data item based on the context. From the above example, consider that before capturing the screenshot the user was chatting with a user contact 1 regarding buying a watch of AAA brand to gift for a brother of the user contact 1. The electronic device (100) identifies the context of the chat as watch and also extracts text such as the AAA brand. When the user at screenshot 702 captures the screenshot of the watches in the shopping application, the electronic device (100) automatically generates the data item of the AAA brand watch and provides the data item in the messaging application for the user to share with the user contact 1 (e.g., at screenshot 710). Therefore, the proposed method provides minimum disruption between the user communication for capturing and sharing the screenshot of the watch, thereby enhancing user experience.
Referring to
At screenshot 804, the electronic device (100) determines each of the text blocks and the image blocks from the contents of the screenshot. At screenshot 806, the electronic device (100) identifies the objects from the image block as a vehicle. Further, at screenshot 806, the electronic device (100) determines the attributes associated with the vehicle based on the metadata of the image block such as for example the vehicle is a car, the color of the car is brown, the car belongs to brand XXX and may be a sports utility vehicle (SUV), etc. At screenshot 808, the electronic device (100) performs the text detection on the text block which is in close proximity to the image block in the screenshot and matches the attributes obtained from the image block to the contents of the text block. Further, at screenshot 810, the electronic device (100) automatically generates the data item by merging the image block consisting the car with the text block which has matching text content based on the attributes of the car.
Referring to
At screenshot 904, the electronic device (100) identifies the objects from the plurality of content in the screenshot such as the image blocks and the text blocks. At screenshot 906, the electronic device (100) analyzes the spaces between each of the image blocks and the text blocks. Further, the electronic device (100) determines the image block and the text block which are in proximity to each other i.e., the image block and text block with less space between each other and also the image block and the text block which are not in proximity to each other i.e., the image block and the text block with larger space between each other.
At screenshot 908, the electronic device (100) combines the closely spaced at least one image block and the at least one text block to form at least one data item and checks whether the data item threshold hold good for the generated at least one data item. On determining that the data item threshold hold good for the at least one data item generated by the electronic device (100), the at least one data item is presented to the user.
Referring to
At operation 2 shown in
Similarly, at operation 2 shown in
Similarly, at operation 2 shown in
Further, in case at operation 2 shown in
Referring to
Referring to
Consider the user makes a periodic payment like rent payment using a payment application 1 in the electronic device (100), as shown in 1102 of
At screenshot 1108 of
Therefore, in the methods and systems of the related art the user has to perform a plurality of steps before sharing the portion of the screenshot especially payment confirmations in case of the rent payment. Unlike to the methods and systems of the related art, in the proposed method the periodic behavior of the user is learnt continuously and based on the learning the sharable data item is automatically generated comprising only the portion of the screenshot generally shared by the user. Also, the electronic device (100) automatically suggests the user contact to which the data item needs to be shared based on the learning. Therefore, the proposed method reduces dependency on the user to perform a series of steps and enhances ease of usage.
Consider the user makes payments such as a rent payment from user bank account to another bank account suing a banking application. The rent payment is reflected in the user transaction details in the user's bank account and the user captures the screenshot of the transaction details from the banking application, as shown in screenshot 1202 of the
At screenshot 1204 of
At screenshot 1206 of
Consider a user orders pizza from a store and generates a payment invoice for the purchase of the pizza. The user then captures the screenshot of the invoice generated after the purchase in the electronic device (100), as shown in screenshot 1302.
At screenshot 1304, the electronic device (100) receives the screenshot which comprises various contents such as order details including order number, date and time of order placement; order summary; payment mode and total billing details with tax information, etc. based on the plurality of data item parameters. The electronic device (100) then generates the at least one data item based on the plurality of data item parameters.
The generation of the at least one data item does not require much user input. The at least one data item generated by the electronic device (100) is provided to the user for selection which can be selected with a tap and shared directly i.e., the order summary data item can be selected and shared by the user directly. Also, the user can select multiple data items at different locations of the screen of the electronic device (100).
In another embodiment, the electronic device (100) extracts selectable text from the screenshot which can be copied to a clipboard and shared by the user.
In another embodiment, the electronic device (100) determines a context between the at least one image block and the at least one text block and then the at least one image block and the at least one text block having the same context is combined to form the at least one data item.
While the disclosure has been shown and described with reference to various embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the disclosure as described by the appended claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
201941036011 | Sep 2019 | IN | national |