Artificial Intelligence in ChatGPT and Use Cases
How much Artificial Intelligence is in ChatGPT and what are the Use Cases?
Classification
We classify
ChatGPT
ChatGPT generates human-like text in natural language and can respond to questions, queries or conversations. It uses machine learning and neural networks to understand text and generate contextual responses based on input. ChatGPT belongs to the category of generative artificial intelligence (AI) - a powerful and transformative technology that can advance many industries from manufacturing to retail and from finance to medicine.
Generative AI
Generative AI aims to produce realistic text, program code, designs, images, and even videos that are similar, but not identical, to those used to train the AI. For this purpose, patterns and structures of large data sets are used and reproduced. They are particularly useful for automating tasks that require content generation in a natural way. This technology is commonly used in applications such as text generation, image synthesis, and music composition.
Large Language Models (LLMs)
Large Language Models in particular are specialized in language processing and are a form of generative AI. They are based on neural networks that learn and represent the probabilities of word or syllable sequences (tokens). This allows them to provide human-like responses to natural language queries. Because LLMs are trained on huge amounts of data, the generated texts read amazingly natural. This creates many new use cases and has made ChatGPT, Google Bard, etc., which use LLMs, fast-growing and very popular apps.
Function
How does it work?
What can ChatGPT & Co. do thanks to LLMs? At the core, it's always the same thing: From a natural language input (optionally acoustic), instructions are given to the LLM. This tries to recognize the intention from the transmitted tokens (basic units into which a text is broken down for processing by machines) and generates natural text. This happens on two levels and is additionally influenced by other parameters:
- At a higher level, called the meta-level, the LLM receives instructions about how to proceed in general. For example, «Respond to customer inquiries. Avoid offensive language, do not quote prices, do not make binding commitments.». The meta-level is usually defined by the so-called meta-prompt, which is usually set by the platform where the AI is available to users.
- With the so-called prompt, the LLM receives the input of the specific task. Example: «Hi! I am not satisfied with the product you delivered last week.». The prompt is created either directly by the user's input or by the user's input and subsequent enrichment of information by program code.
- Additional parameters determine how precise or formal the generated text is. The range is from precise/formal to creative/informal and influences how «seriously» the probabilities of successive tokens are observed.
Examples
Let's go into concrete Examples
During a session with ChatGPT, instructions already given and text generated are preserved. The user can continue the conversation without having to communicate the context and instructions each time. To refer to the example above, of the complaining customer, ChatGPT recognizes the input «Ich möchte mein Geld zurück!» and responds with an appropriate answer.
Another example is shown here:
Prompt (user): Explain ChatGPT to a four year old child
ChatGPT: ChatGPT is a computer program that can talk to you. It learns how people talk by reading many texts that people have written. It can answer your questions, tell you jokes, write stories and much more. ChatGPT is like a friend who always has something interesting to say. 😊
Abstractly, these examples have a common pattern: they create a new text with instruction from the prompt (specific input or request sent to an AI model). In this process, the intentions are derived from the input text to generate an appropriate response.
Prompt (user): In what language is «Chasch mer no d'Suppechelle gäh?» spoken, what ISO language culture name (for example, en-EN) is it, and what does it mean? Answer in Castellano (Spanish)?
The following intentions are derived from the prompt. With Bing Chat - the Microsoft variant of ChatGPT - the detected intentions are immediately displayed in the chat:
- Searching for: Chasch mer no d'Suppechelle gäh?
- Searching for: Chasch mer no d'Suppechelle gäh? translation to Spanish
- Searching for: Chasch mer no d'Suppechelle gäh? language
- Searching for: Chasch mer no d'Suppechelle gäh? language ISO Code
...and analogous to ChatGPT generated the following answer:
Bing Chat: «Chasch mer no d’Suppechelle gäh?» es una frase en suizo alemán, que es un dialecto del alemán hablado en Suiza. El código ISO para el suizo alemán es «de-CH». La traducción al español de esta frase sería «¿Me puedes dar la cuchara de la sopa?».
Checking the answer with DeepL - an AI service for language translation - translated back into English confirms that the task was passed with flying colors:
DeepL: «Chasch mer no d'Suppechelle gäh?» is an idiom in Swiss German, a dialect of German spoken in Switzerland. The ISO code for Swiss German is «de-CH». The English translation of this phrase would be «Can I have the soup spoon?».
Meta prompt (system): You are Janina from customer service and answer customer inquiries. Avoid offensive language, don't quote prices, don't make binding promises. Keep it short but friendly.
Prompt (customer): Hallo! Ich bin mit dem Produkt, das ich letzte Woche erhalten habe, nicht zufrieden.
Note: The customer writes in German. ChatGPT recognizes this and responds in the appropriate language.
ChatGPT: Hallo Peter, es tut mir leid zu hören, dass Sie mit dem Produkt, das Sie letzte Woche erhalten haben, nicht zufrieden sind. Könnten Sie bitte mehr Details über das Problem oder Ihre Bedenken mitteilen? Ich werde mein Bestes tun, um Ihnen zu helfen und auf Ihre Bedenken einzugehen. Mit freundlichen Grüssen, Janina
In the course of the conversation, the customer switches back to his native English out of sheer annoyance.
Prompt (customer): I want my money back.
The context of the conversation was preserved and Janina was able to reply in the language entered.
ChatGPT: I am sorry to hear that you are dissatisfied with the product. In order to get your money back, we need to know the exact details and reason for your dissatisfaction. Please provide more information so I can help you further. Yours sincerely, Janina
Prompt Engineering
Importance of Prompt Engineering and the Dangers of Prompt Hacking
Use Cases
Application Senarios for the Large Language Model (LLM)
Intelligent Document Processing
Intelligent Document Processing extracts specific elements from documents or large texts and processes them further.
Prompt (user): Extract the five best-selling car brands according to units sold from Global Cars Brands.
If we look at the given web page as a source, it is clear that the LLM is very accurate in detecting the intent and a significant transformation of the information contained there takes place.
Bing Chat: The top five selling car brands are:
- Toyota: 9.53 million
- Volkswagen: 9.31 million
- General Motors: 6.83 million
- Hyundai: 6.71 million
- Ford: 4.27 million
Prompt (users): Group car manufacturers by continent from Global Cars Brands.
Artificial Intelligence is able to extract the elements and groupings from the link and provides the following response:
Bing Chat: Here is a possible way to classify car manufacturers by continent:
- Africa: Giad Auto (Sudan), Innoson Vehicle Manufacturing (Nigeria), Kantanka (Ghana), Laraki (Morocco),...
- Asia: Chery, Geely, Great Wall, SAIC-Motor (China), Daihatsu, Honda, Lexus, Mazda, Mitsubishi, Nissan,...
- Europe: Aston Martin, Bentley, Jaguar, Land Rover, Lotus Cars, McLaren, Mini, Rolls-Royce (UK), Audi, BMW,...
- North America: Buick, Cadillac, Chevrolet, Chrysler, Dodge, Ford, GMC, Jeep, Lincoln, Ram Trucks (USA), Acura,...
Customer Service & Operations
The «Conversation with a customer» example from above is classified in the Customer Service & Operations category. It is a module for automating a business process or sub-processes. The combination with other LLM capabilities creates additional benefits for users.
With an upstream sentiment analysis, for example, the triage of e-mails to be answered manually and automatically takes place. The more demanding, non-standard customer queries are answered by natural intelligence and the simpler queries can be processed using AI.
Information Discovery & Knowledge
Information Discovery & Knowledge Mining refers to use cases where, for example, knowledge databases create the fact base for LLM. LLM-based chat with a knowledge database as an example, a catalog of standards or quality standards offers a completely new user experience when researching and finding information.
Microsoft is leading by example by making all product documentation available as chat internally using Azure OpenAI ChatGPT in conjunction with other services.
Other categories
Services like GitHub Copilot even generate program codes based on the LLM or describe existing rather hard-to-read program codes for developers.
Transformation
Fundamental Transformation through Generative AI?
The use cases of generative AI described here are a type of weak AI (also known as «narrow» or «applied» AI). Weak AI is focused on specific tasks and can only work within its given framework, which is usually defined by humans. In contrast, strong AI (also known as «generic» or «artificial general intelligence») refers to systems that possess human-like intelligence and are capable of solving a wide range of cognitive tasks. To date, there is no strong AI, but research in this area is advancing rapidly.
The transformation of the way of working is within reach, as the above paragraph proves. It was created using a prompt for Bing Chat. However, creative work has only shifted - from writing text to writing prompts and to reviewing the generated text. Both activities are still driven by human intelligence.
In general, it can be summarized that not everyday life, but the way of interacting with information systems is fundamentally changing and will continue to change. Conceivably, there will be a partial shift away from elaborately designed input forms with text fields, buttons, sliders, and options to interaction via natural language. The latest developments of services that can be added to existing applications and interact via natural language support this thesis.