Social media platforms typically associate avatars, user icons, profile pictures, etc. with user accounts. These images differentiate users, which can be helpful when a user is adding another user to a network of friends, browsing through a list of users, or even identifying who is posting on the user's feed. Typically, a default icon is used when a user has not uploaded or selected a profile picture to be associated with their account, and some systems may allow a user to select from pre-set stock avatars. In particular, casual users of a social media platform may be less motivated to select an image or to take a picture of their own face to upload and share publicly. Lacking an identifying profile picture, such users with default icons may be less likely to form connections with other users and more likely to decrease use of the social media platform as a result. Thus, encouraging users to adopt a profile picture can increase user engagement and retention.
A server device providing a social media platform is provided herein. In one example, the server device includes one or more processors configured to execute instructions stored in associated memory to send instructions to a client device to cause the client device to display a graphical user interface (GUI) of the social media platform, and receive at least one image of a face of a user of the client device and a selection of at least one predetermined style via the GUI. The one or more processors are configured to, using the at least one image and the at least one predetermined style as input, generate a plurality of artificial intelligence (AI) profile pictures via an AI model, and send the plurality of AI profile pictures to the client device.
This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used to limit the scope of the claimed subject matter. Furthermore, the claimed subject matter is not limited to implementations that solve any or all disadvantages noted in any part of this disclosure.
To address the issues described above,
Briefly, the client device 24 may send a generation request 42 to the server device 10 seeking generation of AI profile pictures. The one or more processors 14 may be configured to receive the generation request 42, including at least one image 44 of a face of a user of the client device 24 and a selection 46 of at least one predetermined style 48 (see
The trained machine learning model 52 may include a library 62 of predetermined templates of different sets of embeddings that describe different style and/or scene information, and may be configured to receive user input 64 selecting one or more styles, etc. from the library. The user input 64 and the set of embeddings 58 may be provided as input to a text encoder 66 and the text encoder 66 may generate an input feature vector 68 based at least on the user input 64 and the set of embeddings 58. This input feature vector 68 may be sent to a diffusion model 70 such as a Stable Diffusion model which is configured to generate a synthesized image 72 of the user based at least on the input feature vector 68.
Turning to
Turning to
In the depicted example, the AI profile picture generation module 18 utilizes the first AI model 52A and the second AI model 52B for two separate and different pathways. Here, the GUI 26 includes a first selector 90 (PLUS) to request generation of a first plurality of AI profile pictures (as the plurality of AI profile pictures 50) via the first AI model 52A and a second selector 92 (LITE) to request generation of a second plurality of AI profile pictures (as the plurality of AI profile pictures 50) via the second AI model 52B. The generation via the first AI model 52A is slower than the generation via the second AI model 52B, and the first plurality of AI profile pictures is greater than the second plurality of AI profile pictures. For example, the Plus pathway may use 5-10 user images 44 as input, finish generating the second plurality of AI profile pictures in about 20 minutes, and produce 4 images as output. In contrast, the Lite pathway may use a single user image 44 as input, finish generating the second plurality of AI profile pictures in less than 30 seconds, and produce 30 images as output. These are parameters that can be set in consideration of quality of images output, time found reasonable by the user base, and availability and cost of model resources, for example. The differences between the inputs and outputs of the two AI models 52A, 52B may be attributable to the first AI model 52A having a larger parameter size than the second AI model 52B. That is, the first AI model 52A may be more complex and therefore more likely to produce a desirable, high-quality outcome, but as a tradeoff, may utilize more resources and take longer to produce results. It will be appreciated that other differences between various models may be used to provide various options for the user, such as using models trained on different data sets and specializing in different types of outputs.
Once the server device 10 has begun profile picture generation with the selected model, an profile picture generation status screen 102 of the GUI 26 may be displayed on the client device 24, as shown in
Upon receiving a selection of the next selector 112, the GUI 26 may be configured to display an profile picture selection screen 114 illustrated in
In some implementations, the GUI 26 may be configured to generate media content such as a post 134 under an account of the user featuring the selected AI profile picture 118, as shown in
At 1308, the AI model may be a first AI model, and the GUI may include a first selector to request generation of a first plurality of AI profile pictures via the first AI model and a second selector to request generation of a second plurality of AI profile pictures via a second AI model. Alternatively, at 1310, the AI model may be a second AI model. The generation via the first AI model may be slower than the generation via the second AI model, and the first plurality of AI profile pictures is greater than the second plurality of AI profile pictures. This may be because the first AI model has a larger parameter size than the second AI model. Thus, at 1312, use of the first AI model may result in more profile pictures, while at 1314, use of the second AI model may result in fewer profile pictures. However, other differences between models may be utilized. By providing at least these two different pathways, users can self-select based on their own preference and level of interest whether to spend more time for better results or sacrifice quality for a quick turnaround. In some implementations, the method may further comprise, responsive to receiving a request from the user for further generation, generating an additional plurality of AI profile pictures via the AI model, and sending the additional plurality of AI profile pictures to the client device. Then, at 1316, the method 1300 may include sending the plurality of AI profile pictures to the client device.
At 1318, the method 1300 may include, upon receiving selection of one of the plurality of AI profile pictures via the GUI, designating the selected AI profile picture as a current profile picture of the user on the social media platform. This may increase the proportion of users on the social media platform with recognizable and useful user profile pictures associated with their profiles and accounts. Accordingly, at 1322, the method may include the GUI being configured to display the selected AI profile picture, and at 1324, the selected profile picture may be displayed to other users with a watermark or frame indicating that the selected AI profile picture was generated with AI. This may serve to advertise the AI profile picture generation feature to other users who may be interested. Another way to draw in more interested users is if the GUI is configured to generate a post under an account of the user featuring the selected AI profile picture.
In some embodiments, the methods and processes described herein may be tied to a computing system of one or more computing devices. In particular, such methods and processes may be implemented as a computer-application program or service, an application-programming interface (API), a library, and/or other computer-program product.
Computing system 1400 includes a logic processor 1402 volatile memory 1404, and a non-volatile storage device 1406. Computing system 1400 may optionally include a display subsystem 1408, input subsystem 1410, communication subsystem 1412, and/or other components not shown in
Logic processor 1402 includes one or more physical devices configured to execute instructions. For example, the logic processor may be configured to execute instructions that are part of one or more applications, programs, routines, libraries, objects, components, data structures, or other logical constructs. Such instructions may be implemented to perform a task, implement a data type, transform the state of one or more components, achieve a technical effect, or otherwise arrive at a desired result.
The logic processor may include one or more physical processors (hardware) configured to execute software instructions. Additionally or alternatively, the logic processor may include one or more hardware logic circuits or firmware devices configured to execute hardware-implemented logic or firmware instructions. Processors of the logic processor 1402 may be single-core or multi-core, and the instructions executed thereon may be configured for sequential, parallel, and/or distributed processing. Individual components of the logic processor optionally may be distributed among two or more separate devices, which may be remotely located and/or configured for coordinated processing. Aspects of the logic processor may be virtualized and executed by remotely accessible, networked computing devices configured in a cloud-computing configuration. In such a case, these virtualized aspects are run on different physical logic processors of various different machines, it will be understood.
Non-volatile storage device 1406 includes one or more physical devices configured to hold instructions executable by the logic processors to implement the methods and processes described herein. When such methods and processes are implemented, the state of non-volatile storage device 1406 may be transformed—e.g., to hold different data.
Non-volatile storage device 1406 may include physical devices that are removable and/or built-in. Non-volatile storage device 1406 may include optical memory (e.g., CD, DVD, HD-DVD, Blu-Ray Disc, etc.), semiconductor memory (e.g., ROM, EPROM, EEPROM, FLASH memory, etc.), and/or magnetic memory (e.g., hard-disk drive, floppy-disk drive, tape drive, MRAM, etc.), or other mass storage device technology. Non-volatile storage device 1406 may include nonvolatile, dynamic, static, read/write, read-only, sequential-access, location-addressable, file-addressable, and/or content-addressable devices. It will be appreciated that non-volatile storage device 1406 is configured to hold instructions even when power is cut to the non-volatile storage device 1406.
Volatile memory 1404 may include physical devices that include random access memory. Volatile memory 1404 is typically utilized by logic processor 1402 to temporarily store information during processing of software instructions. It will be appreciated that volatile memory 1404 typically does not continue to store instructions when power is cut to the volatile memory 1404.
Aspects of logic processor 1402, volatile memory 1404, and non-volatile storage device 1406 may be integrated together into one or more hardware-logic components. Such hardware-logic components may include field-programmable gate arrays (FPGAs), program- and application-specific integrated circuits (PASIC/ASICs), program- and application-specific standard products (PSSP/ASSPs), system-on-a-chip (SOC), and complex programmable logic devices (CPLDs), for example.
The terms “module,” “program,” and “engine” may be used to describe an aspect of computing system 1400 typically implemented in software by a processor to perform a particular function using portions of volatile memory, which function involves transformative processing that specially configures the processor to perform the function. Thus, a module, program, or engine may be instantiated via logic processor 1402 executing instructions held by non-volatile storage device 1406, using portions of volatile memory 1404. It will be understood that different modules, programs, and/or engines may be instantiated from the same application, service, code block, object, library, routine, API, function, etc. Likewise, the same module, program, and/or engine may be instantiated by different applications, services, code blocks, objects, routines, APIs, functions, etc. The terms “module,” “program,” and “engine” may encompass individual or groups of executable files, data files, libraries, drivers, scripts, database records, etc.
When included, display subsystem 1408 may be used to present a visual representation of data held by non-volatile storage device 1406. The visual representation may take the form of a graphical user interface (GUI). As the herein described methods and processes change the data held by the non-volatile storage device, and thus transform the state of the non-volatile storage device, the state of display subsystem 1408 may likewise be transformed to visually represent changes in the underlying data. Display subsystem 1408 may include one or more display devices utilizing virtually any type of technology. Such display devices may be combined with logic processor 1402, volatile memory 1404, and/or non-volatile storage device 1406 in a shared enclosure, or such display devices may be peripheral display devices.
When included, input subsystem 1410 may comprise or interface with one or more user-input devices such as a keyboard, mouse, touch screen, or game controller. In some embodiments, the input subsystem may comprise or interface with selected natural user input (NUI) componentry. Such componentry may be integrated or peripheral, and the transduction and/or processing of input actions may be handled on- or off-board. Example NUI componentry may include a microphone for speech and/or voice recognition; an infrared, color, stereoscopic, and/or depth camera for machine vision and/or gesture recognition; a head tracker, eye tracker, accelerometer, and/or gyroscope for motion detection and/or intent recognition; as well as electric-field sensing componentry for assessing brain activity; and/or any other suitable sensor.
When included, communication subsystem 1412 may be configured to communicatively couple various computing devices described herein with each other, and with other devices. Communication subsystem 1412 may include wired and/or wireless communication devices compatible with one or more different communication protocols. As non-limiting examples, the communication subsystem may be configured for communication via a wireless telephone network, or a wired or wireless local- or wide-area network, such as a HDMI over Wi-Fi connection. In some embodiments, the communication subsystem may allow computing system 1400 to send and/or receive messages to and/or from other devices via a network such as the Internet.
The following paragraphs provide additional description of the subject matter of the present disclosure. One aspect provides a server device providing a social media platform, and the server device includes one or more processors configured to execute instructions stored in associated memory. The one or more processors of the server device are configured to send instructions to a client device to cause the client device to display a graphical user interface (GUI) of the social media platform, and receive at least one image of a face of a user of the client device and a selection of at least one predetermined style via the GUI. The one or more processors are further configured to, using the at least one image and the at least one predetermined style as input, generate a plurality of artificial intelligence (AI) profile pictures via an AI model, and send the plurality of AI profile pictures to the client device. In this aspect, additionally or alternatively, the AI model can be a first AI model and the server device can further include a second AI model, the GUI can include a first selector to request generation of a first plurality of AI profile pictures via the first AI model and a second selector to request generation of a second plurality of AI profile pictures via the second AI model. In this aspect, additionally or alternatively, the generation via the first AI model can be faster than the generation via the second AI model, and the first plurality of AI profile pictures can be fewer than the second plurality of AI profile pictures. In this aspect, additionally or alternatively, the first AI model can have a larger parameter size than the second AI model. In this aspect, additionally or alternatively, the one or more processors can be further configured to, upon receiving selection of one of the plurality of AI profile pictures via the GUI, designate the selected AI profile picture as a current profile picture of the user on the social media platform. In this aspect, additionally or alternatively, the GUI can be configured to display the selected AI profile picture with a watermark or frame indicating that the selected AI profile picture was generated with AI. In this aspect, additionally or alternatively, the GUI can be configured to generate a post under an account of the user featuring the selected AI profile picture. In this aspect, additionally or alternatively, the GUI can be configured to display a plurality of example images representing a plurality of predetermined styles, and selection of at least one of the example images by the user can constitute the selection of the at least one predetermined style. In this aspect, additionally or alternatively, the one or more processors can be configured to, responsive to receiving a request from the user for further generation: generate an additional plurality of AI profile pictures via the AI model, and send the additional plurality of AI profile pictures to the client device. In this aspect, additionally or alternatively, the GUI can be configured to allow the user to request the further generation up to a predetermined number of times.
Another aspect provides a method for a social media platform, comprising sending instructions to a client device to cause the client device to display a graphical user interface (GUI) of the social media platform, and receiving at least one image of a face of a user of the client device and a selection of at least one predetermined style via the GUI. The method further comprises using the at least one image and the at least one predetermined style as input, generating a plurality of artificial intelligence (AI) profile pictures via an AI model, and sending the plurality of AI profile pictures to the client device. In this aspect, additionally or alternatively, the AI model can be a first AI model, and the GUI can includes a first selector to request generation of a first plurality of AI profile pictures via the first AI model and a second selector to request generation of a second plurality of AI profile pictures via a second AI model. In this aspect, additionally or alternatively, the generation via the first AI model can be faster than the generation via the second AI model, and the first plurality of AI profile pictures can be fewer than the second plurality of AI profile pictures. In this aspect, additionally or alternatively, the first AI model can have a larger parameter size than the second AI model. In this aspect, additionally or alternatively, the method may further comprise, upon receiving selection of one of the plurality of AI profile pictures via the GUI, designating the selected AI profile picture as a current profile picture of the user on the social media platform. In this aspect, additionally or alternatively, the GUI can be configured to display the selected AI profile picture with a watermark or frame indicating that the selected AI profile picture was generated with AI. In this aspect, additionally or alternatively, the GUI can be configured to generate a post under an account of the user featuring the selected AI profile picture. In this aspect, additionally or alternatively, the GUI can be configured to display a plurality of example images representing a plurality of predetermined styles, and selection of at least one of the example images by the user constitutes the selection of the at least one predetermined style. In this aspect, additionally or alternatively, the method may further comprise, responsive to receiving a request from the user for further generation, generating an additional plurality of AI profile pictures via the AI model, and sending the additional plurality of AI profile pictures to the client device.
Another aspect provides a client device for accessing a social media platform. The client device comprises one or more processors configured to execute instructions stored in associated memory to display a graphical user interface (GUI) of the social media platform, send at least one image of a face of a user of the client device and a selection of at least one predetermined style via the GUI to a server device including an artificial intelligence (AI) model, and receive, from the server device, a plurality of AI profile pictures generated via the AI model using the at least one image and the at least one predetermined style as input.
Another aspect provides a method for generating an AI profile picture of a social media platform user. The method comprises displaying a graphical user interface (GUI) of the social media platform, in response to an instruction of editing a profile picture of a user, displaying an entrance for generating the AI profile picture, and entering a procedure of generating the AI profile picture. The procedure comprises receiving a first input of at least one image of a face of the user and a second input of at least one style, and generating at least one AI profile picture via an artificial intelligence (AI) model based on the at least one image and the at least one predetermined style. The method further comprises receiving a selection of the AI profile picture from the at least one AI profile picture, and replacing a profile picture of the user to be the selected AI profile picture. In this aspect, additionally or alternatively, the method may further comprise, in response to the replacement, generating media content according to the selected AI profile picture. In this aspect, additionally or alternatively, the method may further comprise, after generating the at least one AI profile picture, generating media content based on the least one AI profile picture. In this aspect, additionally or alternatively, the media content may include a link to generate an AI profile picture for another user.
It will be understood that the configurations and/or approaches described herein are exemplary in nature, and that these specific embodiments or examples are not to be considered in a limiting sense, because numerous variations are possible. The specific routines or methods described herein may represent one or more of any number of processing strategies. As such, various acts illustrated and/or described may be performed in the sequence illustrated and/or described, in other sequences, in parallel, or omitted. Likewise, the order of the above-described processes may be changed.
The subject matter of the present disclosure includes all novel and non-obvious combinations and sub-combinations of the various processes, systems and configurations, and other features, functions, acts, and/or properties disclosed herein, as well as any and all equivalents thereof.
An appendix is attached further describing embodiments of the present disclosure.
This application claims priority to U.S. Provisional Patent App. No. 63/498,268, filed Apr. 25, 2023, the entirety of which is hereby incorporated herein by reference for all purposes.
Number | Date | Country | |
---|---|---|---|
63498268 | Apr 2023 | US |