OpenAI’s Whisper model and edited using GPT-4o.
Tonight, I’m going to talk about how we can use artificial intelligence to detect the unusual. If I knew nothing about artificial intelligence, I would have three main questions regarding this title:
Artificial intelligence (AI) is a field of research. It is a subset of mathematics and computer science, and it overlaps with philosophy and linguistics, depending on the specific subset of AI being studied. This interdisciplinary nature makes it a fascinating field because you often collaborate with a diverse group of people across various domains.
AI has been researched since the 1950s, and although we’ve heard a lot about it in media recently, the increase in computing power over the last few decades has enabled us to perform remarkable tasks based on the research conducted during this time.
I have some examples of how we use artificial intelligence:
Search:
Classification: If you have a large dataset containing labeled images of cats and dogs, for instance, you can use AI to assign the correct label to new images as they come in. Google offers “teachable machines” that allows users to train classification models using their webcam, requiring no programming experience.
Generative AI: Tools like ChatGPT for text generation and DALL-E for image generation have gained popularity recently. While impressive, it’s crucial to recognize that generative AI is not the only area of AI research. There are also significant studies in classification, search, and reinforcement learning, for example.
Now that we’ve discussed what artificial intelligence is, it’s also important to clarify what AI isn’t.
First, AI isn’t about creating conscious machines, a notion often propagated in science fiction. While artificial general intelligence—AI with capabilities akin to human intelligence—is a goal for some researchers, most focus on narrow AI tasks. When companies vaguely claim they “use an AI” for significant goals, it raises red flags.
For example, it would be like a toothpaste manufacturer saying they used “a chemistry” to create better toothpaste—it doesn’t quite make sense. It’s important not to fall for the hype surrounding AI in the media. AI has nothing to do with Terminators or humanoid robots; it’s grounded in real-world applications.
I’ve put together some examples of how AI can be misused:
The Smartschool Case: Smartschool is a platform where parents can communicate with their child’s teachers and access their test results. Recently, the company behind the platform announced plans to use AI to predict student dropout risks. This admirable goal (trying to reduce student dropout) was poorly executed, essentially reducing these students to test results and vaguely implementing AI in their product without public dialogue, leading to significant backlash.
Social Media Companies: Social media platforms are designed to maximize user engagement for profit through targeted advertisement. However, this focus can pose risks to younger users: vulnerable youth are presented with content that romanticizes self-harm, for example. Internal Facebook reports revealed that toxic content on Instagram negatively affects teenagers, but the company plays this down in public.
Dutch Child Care Benefit Scandal: Thousands of families in the Netherlands were wrongly accused of fraud when applying for child care benefits, based on algorithmic predictions. This led to severe financial hardships for these families, illustrating how false positives can have disastrous impacts.
Let’s now discuss the unusual and how we detect it.
Everything around us—every process, system, or activity—produces data, which we can collect through observation and measurement. Here are some examples:
Anomalies are deviations from what we consider to be normal behavior, and these deviations leave traces in our observations. We are interested in understanding the underlying causes of these deviations. For example:
It’s essential to distinguish between anomalies and outliers. Some sources claim they are the same, but this is misleading.
Traditional examples of anomalies often show high peaks and obvious outliers in time series data. However, context matters significantly in determining whether a data point is an anomaly. For instance, a large peak in time series data could be normal if it occurs at regular intervals, suggesting that context is critical.
Now that we’ve reached the point of understanding what anomalies are, we can discuss using AI for detecting these unusual patterns.
Anomaly detection refers to finding patterns in data that do not conform to expected behavior. The goal of identifying anomalies is to alert domain experts that something deviates from the norm and to provide actionable information to address the situation.
For instance, in the medical field, detecting arrhythmia would alert healthcare professionals to investigate further.
Detecting anomalies is highly context-dependent. While classification involves labeling items, anomaly detection focuses on recognizing deviations from expected patterns—often rare occurrences.
Traditional approaches to identifying anomalies are rule-based, which can be cumbersome and require constant updating. Relying solely on these rules may lead to missed anomalies or false triggers.
If we would have a manual rule-based configuration for anomaly detection:
In my research, I previously focused on developing a pattern-based series framework. Instead of relying on hard-coded rules, my framework allows us to apply various AI algorithms to learn a model of normal behavior and test new data against it. However, challenges exist, including:
In summary:
Thank you all for your attention. Slides and transcripts are available on my website. If you have any questions, please feel free to ask. Thank you!