This invention, the Somatic Artificial Intelligence Data System (SAIDS), relates generally to privacy protection, data monetization, and user-controlled data systems using artificial intelligence (AI) data curation.
The explosion of data collection and its potential applications has driven numerous innovations in privacy protection, data monetization, and user-controlled data systems. Various prior art systems and methods provide context for the novel aspects of the Somatic Artificial Intelligence Data Systems (SAIDS) invention.
1. Digital Consolidation Systems (US_20230237349_A1): This invention addresses centralized data management by consolidating disparate data sources into a unified system. It emphasizes ease of data aggregation and improved accessibility for end-users. However, it lacks mechanisms for real-time user-controlled anonymization and monetization of data assets. Unlike SAIDS, it does not focus on opt-in/opt-out economic models or privacy-centric AI-driven cross-checks to prevent unauthorized data usage.
2. Smart Prompts and AI-Based Digital Representation (US_20220014352_A1): This system employs AI for generating personalized digital interactions and encrypted communication methods. While it integrates AI, its focus is on enhancing user engagement and steganographic encryption, rather than offering a holistic framework for data privacy, user control, and monetization as seen in SAIDS. The SAIDS framework surpasses this by correlating multimodal data and enabling user-driven economic participation.
3. Data Futures Platform (US_20220188925_A1): This patent highlights a platform for trading data as a commodity, emphasizing futures contracts for data assets. Although it presents an economic model for data use, it is not equipped to handle real-time data curation, privacy protection, or multimodal data streams as comprehensively as SAIDS. SAIDS uniquely integrates privacy-preserving generative AI agents and somatic-environmental data correlation.
4. Anonosizing Data with AI (U.S. Pat. No. 10,043,035 B2): This invention introduces systems for enhancing data protection by anonymizing structured and unstructured data using dynamically changing identifiers (DDIDs). While it offers strong privacy controls, it does not address the proactive detection of unauthorized data usage or enable economic models for user monetization of data assets, which are critical elements of SAIDS.
5. Revenue Collection from Personal Data Assets (US_20110313915_A1): This system allows users to monetize their personal data assets through a centralized platform, sharing revenue with users based on data usage. However, it lacks integration with AI-driven privacy mechanisms or somatic-environmental data fusion for advanced applications. SAIDS builds upon this concept by offering a more secure, privacy-focused approach that enables real-time control and multimodal data transformation.
6. User Data Management Platform (U.S. Pat. No. 12,100,025 B2): This platform enables user-controlled data sharing and storage but does not integrate AI-driven anonymization or economic frameworks for data monetization. Additionally, it does not consider somatic data or multimodal inputs in its data management approach. SAIDS distinguishes itself by offering a real-time, privacy-centric, user-first economic model.
The SAIDS invention addresses gaps in the prior art by integrating AI-driven data curation, privacy-preserving transformations, and an economic framework for user-controlled data monetization. It uniquely combines somatic and environmental data for AI applications while ensuring users retain control over their data. These features position SAIDS as a significant advancement in data privacy and economic empowerment for individuals.
Data in all forms is becoming the currency of modern economies, including massive datasets employed in the training of Artificial Intelligence (AI) systems. We are running out of data for AI systems and other endeavors. The present invention, Somatic Artificial Intelligence Data Systems (“SAIDS”), is a method and means of personal data curation and transfer, wherein large data collection is enabled and the user as collector retains control of their data. A method and means for a Generative Artificial Intelligence Agent (“GAIA”) to curate a collection of data is enabled whereby the user maintains privacy and active control of which data is stored and potentially transferred. Data transference will comprise the user either opting-in or opting-out of transfer of personal data to other datasets, possibly AI training datasets, public databases or any other entity with data needs. An economic model is enabled whereby the user receives economic benefits from sharing of this data. You may get paid for your data. The alternative opting-out methods and means provide for retention of personal data with additional methods and means of privacy confirmation through AI comparisons to these other, nonpersonal datasets. The pilfering of personal data for AI training and other unauthorized uses is detected and can be dealt with according to data propriety laws. Copyright, trademark and other Intellectual Property (IP) violations may be rectified. The user truly owns their data.
A clear understanding of the key features of the invention summarized above may be had by reference to the appended figures, which illustrate the method and system of the invention, although it will be understood that such figures depict preferred embodiments of the invention and, therefore, are not to be considered as limiting its scope with regard to other embodiments which the invention is capable of contemplating.
50 User comprehensive Data sources
52 Generalized Artificial Intelligence Agent (GAIA)
54 Iterative process
56 Curated data flow/transmission
58 Data Transfer Decision Branch
60 Opting-Out Decision data flow
62 Opting-In Decision data flow
64 Users opting-out of sharing data
66 Users opting-in to sharing data
68 Curated Data opted-in and shared for Training and Other Uses
70 Economic benefits of data sharing through opting-in (e.g. money)
72 Any possible data sources via activities/environment/somatic data
74 Data input device or sensor (e.g. smartphone camera, microphone, etc.)
76 Pilfering Detection/Data sovereignty algorithms
78 Data destination Options when Opting-In
80 Potential SAIDS data purchasers/licensees
82 Sensor for Parallel Data Collection
84 Parallel Data Collection of text or other written & visual information
86 Parallel Data Collection of keyboard input data via visual and/or audio sounds
88 Parallel Data Collection of visual information using lipreading by GAIA
90 Comprehensive Data sources including any legacy, social media or
92 Somatic data sources
94 Somatic data after GAIA curation and correlation
96 User activity with GAIA somatic & correlated data collection
98 Resultant Emulation/Robotic Mimicry thru SAIDS/GAIA methods and means
100 User original raw positional data
102 GAIA transformed positional data
104 User original raw visual appearance data
106 GAIA transformed visual appearance data (anonymization for privacy)
108 Proprietary image original raw visual data
110 Proprietary image GAIA transformed visual data (still and/or video)
112 User original raw voice/audio data
114 GAIA transformed voice/audio data
116 Complex Environment with multiple people and objects
118 User original raw voice/audio data with multiple audio signals
120 GAIA transformed voice/audio data with user trained & selected audio signals transformed or zeroed out
124 User original raw conversational text content
126 GAIA transformed conversational text content
128 GAIA transformed Complex Environment with blanking of visual data to privatize and anonymize people and other objects user has trained GAIA to transform
130 GAIA data blackouts/no recording when in trained private times and locations
132 GAIA Data Curation by Command: User command
134 GAIA Data Curation by Command: GAIA acknowledgement
136 Initial AI Agent Curation Training with Iteration and Feedback through Personal Data Collected and Access given to GAIA
138 SAIDS Embodiments linked to Cellphone/Smartphone
140 SAIDS: Standalone Embodiments without Cellphone
142 SAIDS: Watchlike Embodiments in Use, allowing user data collection first source or in parallel to other data collection
144 Artificial Intelligence as a Service (AlaaS) cloud-based service for GAIA selected outsourcing
146 Local/personal storage & compute resources with user data collection
148 GAIA Hardware
150 GAIA computer-readable medium or software code means
152 GAIA possible Transformer (i.e. GPT) model architecture
The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items. As used herein, the singular forms “a,” “an,” and “the” are intended to include the plural forms as well as the singular forms, unless the context clearly indicates otherwise. It will be further understood that the terms “comprises” and/or “comprising,” when used in this specification, specify the presence of stated features, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, steps, operations, elements, components, and/or groups thereof.
Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one having ordinary skill in the art to which this invention belongs. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and the present disclosure and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
In describing the invention, it will be understood that a number of techniques and steps are disclosed. Each of these has individual benefit and each can also be used in conjunction with one or more, or in some cases all, of the other disclosed techniques.
Accordingly, for the sake of clarity, this description will refrain from repeating every possible combination of the individual steps in an unnecessary fashion. Nevertheless, the specification and claims should be read with the understanding that such combinations are entirely within the scope of the invention and the claims.
In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be evident, however, to one skilled in the art that the present invention may be practiced without these specific details.
The present disclosure is to be considered as an exemplification of the invention and is not intended to limit the invention to the specific embodiments illustrated by the figures or description below.
The inventor claims a method and means of data collection, curation, and transference. The “Somatic Artificial Intelligence (AI) Data Systems” (“SAIDS”) approach is a comprehensive, unobvious and novel system which enables the user (“owner”) to collect a multitude of data, some personal, with benefit of a Generative Artificial Intelligence Agent (“GAIA”) to curate this data as needed. An economic model of “opting-in” or “opting-out” of data transference to other repositories extends the invention's claims. Data ownership and possible sale methods are envisioned.
On 14 Nov. 2023, the inventor John David Fitzsimmons Jr was reading news articles and noted that Artificial Intelligence (AI) systems such as ChatGPT, GPT-4, Bard, Gemini, etc. are running out of data. Not only this limitation, but also data in its totality is oftentimes vacuumed up (i.e. “mined” or “crawled”) from the internet and other sources without the owners' permission. It is a broken system violating the rights of many.
Almost everyone has a plethora of data sources available to them personally. A cellphone. A personal computer/laptop, FitBit. Garmin Fitness tracker, DropBox, old hard disk drive (HDD), etc. The SAIDS could be comprehensive in collecting these data sources and others, either directly from a device or in parallel. An “AI Agent” would collect and curate the user's data as well as facilitate uploading said data to the “Cloud”, other select repositories as well as databases for AI model training. A fundamental condition of the invention is the user/owner retains the rights to her data. If that user “opts-in,” then she is paid for the use of her SAIDS data. If she “opts-out,” then the data is not uploaded to the Cloud or other repositories and methods are envisioned whereby data crosschecking may occur. Data pilfering or data acquired without consent could be detected.
1) Personal (“Somatic”) Data Collection 50 and iteration 54,
2) Generative Artificial Intelligence Agent (“GAIA”) 52 Curation with data flow 56, and
3) Transfer Opt-In/Opt-Out 58 decisions to appropriate destinations: Opting-Out Decision data flow 60 repositories & Opting-In Decision data flow 62 repositories.
The collection of data follows a logical flow 56 with this novel AI Agent “GAIA” 52 curating the data. The details of this curation will be comprehensively explained in the specification that follows. The subsequent element of a “Transfer Decision” 58 is the “opt-in” 62 or “opt-out” 60 choice for the user. The methods and means of the invention based on this decision will be explained as well in the specification that follows.
Big data needs are potentially fulfilled when many users employ this invention as seen in the figure. This may be a case of highest bidder winning the rights to your data. Subscribing fees are one possibility to compensate the user for the resources of data. People are resigned to paying subscriber fees for many data services such as cable television, NetFlix, Disney+, HBOMax, etc., but now they are able to be paid by subscribers to their data. A nice way to democratize data in the digital AI age.
As an economic model, SAIDS encompasses anything within the purview of a personal data source. If it can be collected in electronic form, it can be curated and passed to the transfer decision of opt-in/opt-out. The figure lists “somatic” data which can be from the body or around the user's body. Examples include but are not limited to: a person's location (e.g. latitude, longitude, altitude), visual data collected from a video source, aural or audio data collection such as ambient noises/speech/environmental sounds, etc., haptic/sensation data from pressure transducers or similar means, body state such as a person's blood pressure, heartrate, and temperature, video screens/monitors, keyboard interactions and any other environmental stimuli.
The AI Agent 52 shown in
Investigating the prior art will not require much digging into the past because the means to implement an AI Agent as envisioned was simply not in existence a very short time ago. Imagine looking for prior art of an electronic computing machine in the Middle Ages—The technology simply did not exist.
Continuing with
Multimodality offers synergy here, with the term multimodality referring to a combination of multiple sensory and communicative methods, such as sound, video, text, print, images, music, sensory/haptic, and so on, that produce data, analytic or training potential. From a biomedical perspective, multimodality could correlate the body data parameter of heartrate to environmental noise levels (See
Given the data collected and curated by the AI Agent, the transfer decision in this case is opting-out 60, not sharing the data but retaining it if selected and running 76 this personal retained data (“GAIA AI Agent Crosscheck”) against available public datasets. As shown, cloud data is compared to personal data with algorithms 76 comprising checksums (i.e. condensed data derived from another block of digital data for the purpose of detecting errors & authenticity), full-factorial token comparisons as well as other algorithms for checking data integrity. A primitive manual analogue is typing your name or Social Security Number (SSN) into Google search and reviewing the hits/results. Any hits of the exact SSN number should not be in the public domain. Similarly, the GAIA AI Agent automatically and continually scans public data to ascertain if your personal data has been pilfered. An example would be an artist having a personal painting in the form of a .png file that could be searched for in public databases (“Cloud”) by the AI Agent. Checksums are one algorithm that the AI Agent could use for pattern recognition against other files. Upon confirmation of appropriation without permission, the user may exercise her rights as needed.
According to vocabulary.com, “Somatic is a fancy word that just means dealing with the body.” In this document, somatic applies to both:
1) Body data such as a person's heart rate, blood pressure, temperature, location, etc. AS WELLAS,
2) Somatically worn (i.e. “body-worn”) apparatus for collection of this data and other environmental data such as video and audio.
Ideally, device compatibility would allow SAIDS to access data directly from other electronic devices such as a FitBit, smart-/cellphone, laptop computer, etc. Unfortunately, some of these devices are restrictive in their data formats and access. SAIDS video and audio sensors would obviate these data roadblocks by intelligently analyzing feeds and distilling data. In the movie “2001: A Space Odyssey,” there is a scene where the HAL 9000 computer is denied access to the audio from the talking astronauts in the pod, but HAL can read their lips. Same principle here—SAIDS employing GAIA would view/listen/access information to something and discern data content through means of artificial intelligence. Speech to text. Type to text. View a monitor and run a means of optical character recognition (OCR) to generate the data that is endemic to that device. Audio AI keyboard “attacks” are becoming commonplace—an AI system can discern what you are typing with only the sound of you clacking away. SAIDS would do the same if not able to access the direct keyboard data or view the keyboard to convert to text. Many heretofore unexplored possibilities open up with an AI Agent coupled to multimodal data. Imagine how a dog can sense an earthquake before it happens by the lower frequencies it hears. Imagine SAIDS with GAIA applying powerful AI methods while viewing and listening at a car dealership and whispering in a person's earbud that the salesman is being deceptive and a better deal can be had. Profound possibilities open up.
How does a carpenter replace a door? A chef prepare a delicious entrée? A painter produce a masterpiece? These high-level, advanced mental & physical behaviors require large datasets to train AI systems coupled to means of interaction with the physical world (e.g. robots)—cutting edge AI will fuse machine intelligence with physical capability in the real world (AI brains get brawn). SOMATIC AI (SAIDS) IS THE ANSWER TO TRAINING MACHINE INTELLIGENCE manifesting in useful ways when paired with physical capabilities, producing useful roles in our lives and economy. Sending an EOD robot to defuse an explosive device without putting a person at risk is a current example but these methods and means will extend capabilities far past the current state of the art.
Data curation includes “all the processes needed for principled and controlled data creation, maintenance, and management, together with the capacity to add value to data,” from the online Wikipedia. Also, “Data curation is the process of creating, organizing and maintaining data sets so they can be accessed and used by people looking for information. It involves collecting, structuring, indexing and cataloging data for users in an organization, group or the general public. Data can be curated to support business decision-making, academic needs, scientific research and other purposes,” from the website https://www.techtarget.com/searchbusinessanalytics/definition/data-curation. The term is broad and inclusive of many processes that are designed to organize and give meaning to data throughout its lifecycle. The GAIA AI Agent methods and means of this invention will incorporate a plethora of these processes in an advanced, automatic manner after training and fine-tuning.
In discussing how the GAIA AI Agent 52 initially fits into the SAIDS process during data creation—The AI Agent will be involved with the transference & data crosschecking in the opting-in or opting out processes as well. The GAIA AI Agent was depicted as a second step in the methods and means of the SAIDS invention but it is integrally involved in all steps of data curation from data collection/creation to transference/disposition to data sovereignty checks.
Specifically, somatic transformation and anonymizing occurs when collected data is accessed by the GAIA AI AGENT and it alters/filters data to preserve data training efficacy while preserving relevant persons' and entities' privacy to the degree specified by the user. A specific example includes transforming any data stream that can reveal identity: names may be transformed from textual streams, audio with speech will transform peoples' names into either pronouns or a pseudonym something like “Jane” or “Person A” based on the fine-tuning the user performed with the GAIA AI Agent. A video will have facial and other physical characteristics transformed to not reveal identities as well. These are very recent technological advancements with AI that the GAIA AI Agent will employ to curate SAIDS data for obfuscation, anonymity and privacy matters. More examples of AI Agent curation are:
Given bad actors wanting to scam people, GAIA proactively allows for obfuscation and anonymization pf voices captured and processed. Original voices 112 are altered in the GAIA curation into transformed voice/audio data 114, not susceptible to exploitation. As in visual data processing, multiple methods are available to change audio such as frequency modulation (e.g. add or subtract an octave), amplitude modulation and random noise addition to the signal. The user may specify to GAIA.
In reality, the man is asking the woman, “Did you hear what Ted said to Mona today?” 124 and that data stream is passed to GAIA that has been trained & fine-tuned to alter the speech to “Did you hear what he said to her today?” 126 The real names have been changed in the audio data, textual data, video data and any other associated data. Similar to “deep fakes”, the technology exists for this transformation of speech to be nearly seamless—the original speaker's voice can be patterned and replicated to pronounce the anonymized pronouns instead of the real names. Concomitantly, the speaker's voice can be masked and altered in waveform/frequencies if the user has trained GAIA to do so.
Initial training methods and means may be performed at the device manufacturer. Examples of this training comprise such tasks as speech recognition, Optical Character Recognition (OCR), keyboard typing video capture & conversion to ASCII text data, location transformations, encryption, and somatic data stream correlations. Different users will likely have similar needs for basic GAIA functions and this functionality can be trained into the GPT/neural network/algorithms methods and means of SAIDS prior to the user taking ownership. Subsequent to this, each person will train & fine-tune their GAIA AI Agent 136 so the AI Agent will curate personal data properly for filtering/transforming/anonymizing/data labelling/etc. to facilitate the Opt-In Process, if chosen or the Opt-Out Process and concomitant cross-checking of data integrity. One simple example is teaching the AI Agent which places may be off limits for data collection—say bedrooms and bathrooms that will be unique to that user. Also, individual speech recognition will likely need fine-tuning after the user begins using SAIDS.
This specification has given detailed explanation of the processes of SAIDS, now more consideration will be given to the apparatuses that enable the invention.
At the most basic level, a person's cellphone (i.e. “smartphone”) alone with adequate compute and software could fulfill many of the methods and means of the present invention. Said cellphone typically has a video camera and microphone (“mike”) to collect data streams. A scaled-down GAIA could be implemented via this device. To further extend SAIDS, a cellphone cradle/harness mounting the cell to the user's torso is another ramification whereby the cameras face forward and replicate a person's visual field of view. Other Embodiments adding a secondary device linked (e.g. Bluetooth connectivity) to a cellphone comprise an earbud+camera+mike device, a camera+mike pod wearable device, a chest/lapel device, necklace device, watch, glasses and a hat-mounted device. Any location affixed to, on or around the body could be used within the embodiments envisioned.
Illuminating the methods and means of building this invention, machine Learning and artificial intelligence (AI) have been around for quite a few decades but have made radical leaps in capabilities in the last years, especially since late 2022 with the release of OPENAI's ChatGPT. Key changes to the structure of Feed Forward Neural Nets with self-attention and Generative Pretrained Transformers (GPTs) have revolutionized the AI landscape. The hardware, with varied processing units like CPU/TPU/GPU/NPU/etc., and software implemented via trained neural networks/GPT's/AI models is evolving quickly. SAIDS will leverage off the latest hardware and software to bring about novel methods & means of data collection, curation and transference. Users will see personal, wearable devices that have compute resources exceeding yesteryear's mainframes and even supercomputers.
More is elucidated in two different phases of AI development: 1) AI training and 2) AI Inference. I asked ChatGPT to explain the two different phases by comparing and contrasting their objectives, goals and resources needed:
“Both phases serve distinct purposes in the development and application of AI models.
Objective: The primary goal of the training phase is to teach an AI model to recognize patterns and make predictions based on the input data.
Goal: To optimize the model's parameters by exposing it to large volumes of labeled or unlabeled data, enabling it to learn patterns, correlations, and features.
Resources Needed: The training phase requires substantial computational resources, including high-performance GPUs or specialized hardware like TPUs (Tensor Processing Units). Additionally, large datasets are crucial for training high-performing AI models.
Data Preparation: Curating, cleaning, and formatting data for the model's training. This involves labeling data if necessary and splitting it into training, validation, and test sets.
Model Training: Using algorithms like deep learning, the model learns from the data by adjusting its parameters through iterations. This process involves forward and backward passes (forward propagation and backpropagation) to minimize the difference between predicted and actual outputs (the loss function).
Optimization: Fine-tuning the model's hyperparameters (e.g., learning rate, batch size) and architecture to achieve better performance.
Validation: Assessing the model's performance on validation datasets to prevent overfitting and ensure generalizability.
Objective: The main objective during inference is to apply the trained model to new, unseen data and generate predictions or perform tasks based on this data.
Goal: To make accurate predictions, classifications, or perform tasks using the trained model with real-time or batch inputs.
Resources Needed: Inference typically requires fewer resources compared to training. It primarily involves running the trained model on available hardware, such as CPUs, GPUs, or even edge devices.
Input Processing: Receiving new data or queries that require AI-based processing or predictions.
Model Execution: Applying the previously trained model to the new data to make predictions, classifications, or generate outputs.
Output Generation: Producing the desired output based on the model's predictions or task performed.
Response Time: Inference often needs to be fast, especially in real-time applications, making latency an essential consideration.
Objective: Training aims to teach the model patterns from data, while inference applies this learned knowledge to new, unseen data.
Resource Needs: Training demands substantial computational resources and large datasets, while inference requires relatively fewer resources for executing the trained model.
Process: Training involves data preparation, model training, and optimization, whereas inference focuses on input processing, model execution, and generating outputs.
Time: Training is a time-consuming process, often taking hours, days, or even weeks, while inference is typically faster and can be performed in real-time.
In summary, the training phase builds the AI model's knowledge, while the inference phase applies that knowledge to make predictions or perform tasks on new data, with different resource requirements and goals for each phase.”
This lengthy discourse by ChatGPT shows that the SAIDS device/means/methods may be AI trained prior to user acquisition (i.e. “at the manufacturer”) and fine-tuned by the user. The inference methods will occur in real or near real-time during data collection, curation, transference and cross-checking. It's important to remember that much of the SAIDS “software” is the trained neural network or equivalent AI model construct when considering “how to build it.”
1) Artificial Intelligence as a Service (AlaaS) is a cloud-based service offering artificial intelligence (AI) outsourcing 144;
2) Local/personal storage & compute resources 146;
3) Some hybrid or combination of 1 & 2.
The methods and means of the SAIDS invention comprise any compute resources whether local, cloud-based or other. During user data collection, it is possible to share or parallel process GAIA tasks such as curation. Say a person was wearing a SAIDS device to collect data during a walk downtown. This data might be dense with people walking past on a sidewalk and GAIA may need to anonymize these faces and voices while anonymizing location data in real-time. If a non-local data network is available, the audio channels may be passed to AlaaS to parallelize these data curation tasks. Encryption and privacy would be maintained concurrently.
“
The text reply from another query to ChatGPT further refining computer resources involved in SAIDS:
“Running an AI model like me typically requires substantial computational power and resources, depending on the specific model's size and complexity. Here are the key hardware components typically needed to run AI models:
Central Processing Unit (CPU): CPUs are essential for general-purpose computing tasks. For AI models, having a multicore processor with high clock speeds can help handle parallel computations.
Graphics Processing Unit (GPU): GPUs, especially those designed for parallel processing, are crucial for accelerating the training and inference of deep learning models. NVIDIA's CUDA-enabled GPUs are popular choices due to their optimized architecture for AI workloads.
Memory (RAM): Having ample RAM is crucial for handling large datasets and model parameters efficiently. Deep learning models often require significant memory, so having a high-capacity RAM module is beneficial.
Storage: AI models generate and handle vast amounts of data. Solid State Drives (SSDs) are preferable over Hard Disk Drives (HDDs) due to faster read/write speeds, which can improve model loading times and overall performance.
Specialized AI Processors: Some AI models benefit from specialized hardware, such as TPUs (Tensor Processing Units) developed by Google or other AI-specific accelerators. These processors are designed specifically for deep learning tasks and can significantly speed up computation.
Cluster or Cloud Computing Resources: For larger models or extensive computations, utilizing clusters of machines or cloud-based services (e.g., AWS, Google Cloud Platform, Azure) can provide the necessary scalability and resources on-demand.
The hardware requirements vary based on the size and complexity of the AI model and the intended use—whether it's for training or inference tasks. More complex models with a higher number of parameters typically demand more powerful hardware for efficient execution.
It's essential to consider the specific requirements of the AI model and its intended application when determining the hardware setup needed for optimal performance.”—ChatGPT, 7 Dec. 2023
It's not within the purview of the present specification to delve into the atomics of how to build hardware that can realize the invention. A typical electronic machine with connectivity that might actualize SAIDS has standard compute resources for an electronic computer: storage, means of computation (CPU, GPU, TPU, NPU, . . . ), networking means and methods/means of display and input/output coupled with appropriate sensors, cameras, microphones, etc.
Interestingly, it turns out that current smartphones may be adequate as a minimum viable product to enable the methods and means of the SAIDS invention:
“Large language models can be squeezed onto your phone—rather than needing 1000s of servers to run—after breakthrough. Running massive AI models locally on smartphones or laptops may be possible after a new compression algorithm trims down their size. . . . What we propose in this paper is one technique that is used in combination with techniques proposed in prior works. And I think this combination will enable us to use LLMs on mobile devices more efficiently and get more accurate results.” (LiveScience online by Keumars Afifi-Sabet, Dec. 5, 2024)
The invention here envisioned will not require magic but standard electronic machines that are ubiquitous.
After initial training & fine-tuning, use of the invention will be close to effortless. Charging of the means of the invention are akin to charging of a person's cellphone or other electronic device. Plug it in or drop it into its cradle. The user may choose to collect data by wearing the means of the invention. By example, charging the device could be trained to mean sleep mode with no data collection. These diverse collect conditions are trainable by the user, with manual/speech command override as desired.
Big picture summary: the SAIDS GAIA AI agent will collect, curate and transfer data to specified repositories for the opted-in user. The user may receive funds similar to subscriber fees. The opted-out user will have data integrity crosschecks automated as well. Copyright, privacy and other Intellectual Property (IP) violations may be detected and dealt with according to law.
Closing with how the present invention is patentable. How is SAIDS different than other methods/means such as users creating & uploading videos to YouTube?
One major difference is integrated AI Agent data curation. The personal GAIA AI Agent acts autonomously after training and automates the high-level personalized process of data collection, curation and transfer when OPTING-IN.
Limitations of the prior art such as YouTube Video Editor require laborious user training and manual involvement to primitively “curate” a video prior to upload.
Somatic data collection and correlation to other data sources are novel and unobvious. A person's location, temperature, BP, etc. is merged with video, audio and other data collection. There are massive possibilities for biosciences and medical research.
The novel economic model of Opt-In or Opt-Out: make money/sell data or data integrity crosschecks to prevent pilfering as envisioned are not available outside of this invention.
This application is entitled to the benefit of Provisional Patent Application Ser. No. 63/616,528 filed Dec. 30, 2023.
Number | Date | Country | |
---|---|---|---|
63616528 | Dec 2023 | US |