In this article, discover the development of a new concept: Speech Mining. The focus is shifting from the traditional technology-centered view of process mining to an innovative approach that focuses on people. Dive into a modern method that uses real-time conversations to reconstruct process descriptions. Using practical examples such as documenting an assembly process, it is shown how ChatGPT4 enables precise assembly instructions during execution through speech-to-text technology and image assignment.
The birth of a concept
Traditional process mining focuses on analyzing the digital traces that employees leave behind in the company's IT systems, where processes are considered purely technical. But what happens when the focus is not on digital traces but on the spoken word? This is where an innovative concept is being developed that combines speech analytics and audio mining — speech mining.
You've never heard of speech mining? No wonder, because, strictly speaking, it doesn't even exist yet! Rather, it is a vision of how AI could revolutionize process modeling in the future. But right from the start: What exactly does speech mining actually mean.
Speech analytics, also known as interaction analysis, is a technology that uses artificial intelligence to understand, process, and analyze human language. Audio mining describes the systematic extraction of information from audio sources. It is about gaining important information by listening to spoken language. The combination of speech analytics and audio mining creates a new concept: speech mining — which combines human interaction and efficient process design.
In contrast to classic process mining, speech mining focuses on the spoken word. Speech mining is therefore a modern method that can be used to represent the current situation or the current state of a thing or process. Human interaction is used in real time to describe processes, generate new content and create initial models. The approach is based on extracting process-relevant information from spoken communication.
Practice-oriented speech mining
To put theory into practice, the following example provides a vivid insight into the application of speech mining:
Same use case, different setting: Why not also record structured conversations about a process using speech-to-text? This approach enables assembly instructions to be created collaboratively by several people describing various steps in detail. In contrast to traditional methods, where one person describes the entire process, in this setting, different participants can describe different parts of the process. One more stage of development: Imagine how cool it would be if a group of people talked loosely about a process without sticking to a fixed structure, and then you let ChatGPT create a coherent and meaningful sequence from all these steps.
The company as a living 'digital twin'
Imagine being able to iteratively design a dynamic process landscape for your company, including process models, work instructions, and goals, through leadership impulses. This evolving “digital twin” is initially based on generic input data, but is becoming more and more specific as a result of continuous adjustments to current leadership impulses. This approach focuses on reducing generic management documentation and creating a flexible structure that reacts to change and is driven by leadership impulses. You can also record conversations about processes in your organization and thus keep your digital twin up to date with discussions and decisions. It also makes sense to integrate classic process mining into the digital twin. On this basis, a living system is created, because it is possible to ask the Digital Twin questions at any time, such as: “Can I buy the pencil myself or do I need approval for it?” and the correct answer is always delivered. In the background, however, a clearly structured process landscape that can be read by humans remains.
A promising future
Although the topic is still in its infancy, it is already apparent today that it can be used successfully. The possibilities are varied and the momentum that speech mining brings to the world of process modeling is impressive. The question of how this field will develop remains exciting. But one thing is certain: The great potential of speech mining will decisively shape the way we understand, document and communicate processes. It remains to be seen what exciting developments the future will bring in this area.
Sign in to get in touch with Carsten directly.
Always stay up to date: In our newsletter, we provide you with a fresh update on the Modell Aachen Insights every month.
Whether it's crisp inputs from the Quality Compass or detailed video interviews — you can now listen to our Aachen Insights model on management systems, quality & process management conveniently on the go.
Subscribe to Spotify nowSince 2009, Modell Aachen GmbH has stood for interactive management systems based on wiki technology. With software and management consulting, we support our customers on their way to process-oriented corporate management and lightweight knowledge management. With our Aachen Insights Blog model, we share our knowledge about interactive management systems, process management and quality management with you.
Get to know the Aachen modelMake your processes more efficient and your company more modern — with the interactive management software Q.wiki! Test Q.wiki without obligation and free of charge.
Get to know Q.wiki