The next expert meeting of the NLP group will be held on 1. October 2024 at the usual location – ZHAW on Lagerstrasse – followed by an Apero at Nüü. This session will focus on the topic of “Contextualization,” featuring the following speakers:
Hans Ramsl from Weights and Biases will discuss “Beyond Vibe Checks: Establishing Systematic Evaluation Frameworks for Semantic Contextualization in LLMs”.
Tobias Schimanski, PhD candidate at the University of Zurich and affiliated researcher at the University of Oxford, is going to talk about “Challenges and Pathways for Retrieval Augmented Generation” by looking into two of his recent papers.
TBA
Join us on October 1st at 17:30 at Lagerstrasse 45, 8004 Zurich, in room ZL O6.16 (6th floor). The presentations will run from 17:30 to 19:00, followed by an Apero downstairs at 19:00.
Online participation is also possible.
Please use the following form to confirm your attendance by September 24:
SwissText is an annual conference that brings together text analytics experts from industry and academia. It is organized by the Swiss Association for Natural Language Processing (SwissNLP) in collaboration with local universities and the Zurich University of Applied Sciences (ZHAW).
Registration is open from the first of March. Early Bird Tickets are available until the fifth of May.
Swisstext is organised jointly by the University of Applied Sciences of the Grisons (FHGR) and the Zurich University of Applied Sciences (ZHAW) with the support of the Swiss Innovation Agency (Innosuisse) and several industrial sponsors.
General Chair: Prof. Corsin Capol (FHGR), Prof. Dr. Mark Cieliebak (ZHAW) Program Chair: Prof. Dr. habil. Albert Weichselbraun (FHGR) Scientific Track Chair: Dr. Claudiu Musat (Google) Workshop Chair: Dr. Elisabeth Maier (Karakun AG) Conference Happiness Manager: Lucas Zimmermann (FHGR) Organizational Memory: Manuela Hürlimann (ZHAW)
The next Expert Meeting of the NLP group on 9 April 2024, 17:30-19:00, will focus on “LLMs in Practice” and includes the following speakers:
Flurin Gishamer, Senior Data Scientist at Open Systems Title: “Harnessing Large Language Models for Augmenting Managed Service Operations”
Marcel Neidinger, Solution Architect at Amazon Web Services (AWS) Title: “Patterns and Lessons Learned from Building GenAI Applications in the Wild”
Arben Sabani, Language Technology Software Developer at SuperText Topic: “Leveraging LLMs for Complex Task Solving in Agent Systems”
It will take place at the ZHAW premises in Lagerstrasse 45, 8004 Zurich in room ZL O3.08 (3rd floor). The meeting will be followed by an apéro.
Online participation is also possible.
Please use the following form to confirm your attendance by March 31: https://forms.gle/iPk6PfFiWa58xz1j7. We will then send you a calendar invitation which also includes online participation details. (Note that the registration form also contains information on the SwissNLP General Assembly, which takes place right before the Expert Meeting, but this is only relevant to members of SwissNLP.)
Our next Expert Meeting on Thursday, 30 November 2023, 17:00-18:30, will focus on various aspects and applications of NLP & Health
It will take place at the ZHAW premises in Lagerstrasse 45, 8004 Zurich in room ZL O6.12 (6th floor). The meeting will be followed by an apéro.
Online participation is also possible.
Please use the following form to confirm your attendance by October 30: https://forms.gle/eEYLWjpGNKLauGmF8 We will then send you a calendar invitation which includes online participation details.
In the form, you also have the opportunity to let us know which topics you are interested in for our meetings in 2024 and to suggest speakers!
The following presentations are confirmed for the meeting on 30 November:
Elif Ozkirimli, Head of Computational Science Products at Roche Title: Adoption of NLP in Healthcare: a Strategic Perspective Across the Pharma Value Chain.
Matteo Manica, Senior Research Scientist at IBM Title: Harnessing the Power of Language Models to Accelerate Material Design
Ahmad Aghaebrahimian, Associate Researcher at ZHAWTitle: Medical Informatics Powered by Large Language Models and the Semantic Web; A work in progress
Nicolas Löffler-Perez, Data Scientist at SwissmedicTitle: Medicrawl: a ML-based Application for Finding Illegal Products in Online Markets.
We would like to invite you to the GEOSummit open innovation workshop on “Open Data Value Creation”, hosted by ERNI in Zurich.
Creating value based on open data is difficult, as topics such as harmonization of data formats, automatic data lineages or the standardization of data linkage processes are technically challenging and also not unproblematic from a regulatory point of view. But even supposedly simpler steps such as contextualized data search and raising awareness of the availability of open data in general have only been implemented to a very limited extent.
In this workshop, we would like to invite stakeholders from research, industry, NGOs and the public sector to work on solutions and to define concrete steps on how to improve value creation through open data. Dr. Jürg Meierhofer (ZHAW) will be our moderator and guarantees a result-oriented, structured approach to the workshop.
As thematic use-case we will focus on the energy sector. This includes, for example, the analysis of consumption data collected by sensors and smart meters – or, more generally, exploiting the potential of energy management systems due to the increasing electrification. Especially with regard to energy savings (e.g. usage-specific lighting scenarios), these so-called IoT recommender systems have the potential to deliver great added value. These areas are not detached from the building shell, which brings us back to the various smart building & construction technologies and of course to the electricity providers (e.g. with regard to dynamic tariffs). in this context, we discuss topics such as data protection, data infrastructure and data management, open platform ideas for data sharing and trading and the smart services that could be built on them.
Free participation, but registration is necessary: Link
Wir möchten Sie zum GEOSummit Open Innovation Workshop “Open Data Value Creation” einladen, der von ERNI in Zürich veranstaltet wird.
Die Wertschöpfung auf der Basis offener Daten ist schwierig, denn Themen wie die Harmonisierung von Datenformaten, automatische Datenabgleiche oder die Standardisierung von Datenverknüpfungsprozessen sind technisch anspruchsvoll und auch aus regulatorischer Sicht nicht unproblematisch. Aber auch vermeintlich einfachere Schritte wie die kontextualisierte Datensuche und die Sensibilisierung für die Verfügbarkeit offener Daten im Allgemeinen wurden bisher nur in sehr geringem Umfang umgesetzt.
In diesem Workshop möchten wir Akteure aus Forschung, Industrie, Nichtregierungsorganisationen und dem öffentlichen Sektor einladen, an Lösungen zu arbeiten und konkrete Schritte zu definieren, wie die Wertschöpfung durch offene Daten verbessert werden kann. Dr. Jürg Meierhofer (ZHAW) ist unser Moderator und garantiert einen ergebnisorientierten, strukturierten Ansatz für den Workshop.
Als ein mögliches Anwendungsgebiet steht hierbei der Energiesektor im Fokus. Dazu gehört z.B. die Analyse von Verbrauchsdaten, die durch Sensoren und Smart Meter gesammelt werden – oder ganz allgemein die Nutzung des Potenzials von Energiemanagementsystemen aufgrund der zunehmenden Elektrifizierung. Gerade im Hinblick auf Energieeinsparungen (z.B. nutzungsspezifische Beleuchtungsszenarien) haben diese sogenannten IoT-Recommender-Systeme das Potenzial, einen großen Mehrwert zu liefern. Diese Bereiche sind nicht losgelöst von der Gebäudehülle, womit wir wieder bei den verschiedenen intelligenten Gebäude- und Bautechnologien und natürlich bei den Stromanbietenden Organisationen sind (z. B. im Hinblick auf dynamische Tarife) sind. In diesem Zusammenhang diskutieren wir Themen wie Datenschutz, Dateninfrastruktur und Datenmanagement, Ideen für offene Plattformen zum Datenaustausch und -handel und mögliche, darauf aufbauende “intelligente” Dienste.
Our next Expert Meeting on Monday, 21 August 2023, 17:00-18:30, will focus on various aspects and applications of Large Language Models.
It will take place at the ZHAW premises in Lagerstrasse 45, 8004 Zurich in room ZL O3.01 on the third floor. The meeting will be followed by an apéro. Online participation is also possible.
We will then send you a calendar invitation which includes online participation details.
The following presentations are confirmed for the meeting:
Kim Engels, Converto AG – Large Language Models for Cross-Media Marketing
In this talk, Kim briefly presents some examples of how he and his team at Converto use AI and LLM to improve or speed up their projects.
Besides approaches such as text generation for newsletters, there are also variants such as code generation within the team as well as the use of self-developed solutions to create 3D models for customer campaigns.
Florian Tramér, ETH – Are Aligned Neural Networks Adversarially Aligned?
Large language models are now tuned to align with the goals of their creators, namely to be “helpful and harmless.” These models should respond helpfully to user questions, but refuse to answer requests that could cause harm. However, adversarial users can construct inputs which circumvent attempts at alignment. In this talk, we’ll discuss to what extent these models remain aligned, even when interacting with an adversarial user who constructs worst-case inputs (adversarial examples). We’ll see that existing optimization attacks are insufficiently powerful to reliably attack aligned text models, except when these models are multimodal (i.e., they can process both text and images). In that case, we show these models can be easily attacked, i.e., induced to perform arbitrary un-aligned behavior through adversarial perturbation of the input image.
Alex Paramythis, Contexity AG – Adapting Large Language Models for Customer Request Handling
With the rise of Generative Large Language Models (LLM), companies are looking into the many opportunities proffered by this new technology. One area of particular interest is the automated handling of customer requests (e.g., received through email, chat, social media, etc.) using the institutional knowledge at hand. In such a context, LLMs may need to be trained on, or have access to, privileged, non-public information in the company’s knowledge base. This, in turn, entails that the models need to be prepared within, and served from, a company’s own infrastructure to prevent information leakage — a requirement that points in the direction of commercially friendly open-source models. In this talk we will present our work on generation of responses to customer requests using the IGEL (a BLOOM based model), FLAN-UL2, and Falcon LLMs. For the first two models we will also report on our attempts to fine-tune the models before use, with a variety of training data.
This Expert Meeting will take place at the ZHAW premises in Lagerstrasse 45, 8004 Zurich in room ZL O3.01 on the third floor (online participation is also possible for those who prefer this option) on Wednesday, May 10 from 17:30-19:00. After the meeting, there will be an apéro so that you can carry on your discussions and get to know each other.
We have the following two talks confirmed:
End-to-end ASR for Swiss German at Microsoft: A Transducer Approach Oscar Koller, Applied Scientist at Microsoft
Automatic speech recognition (ASR) for Swiss German is a challenging task due to the lack of a standardized writing system and the high regional variability of the dialects. In this talk, we present our work on developing end-to-end ASR models for Swiss German at Microsoft using transducer architectures. We show that transducers outperform hybrid models by over 20% in word error rate on a multi-dialectal corpus of Swiss German speech. We also compare our models to Whisper, a state-of-the-art sequence-to-sequence model for low-resource ASR, and find that transducer models achieve comparable results with much smaller model size and training time. Finally, we discuss how end-to-end models produce transliterations of Swiss German words instead of standard German translations affecting the readability and usability of the output and propose solutions to this problem.
Revolutionizing Natural Interaction with Swiss German: A Glimpse into the Future of Conversational AI Claudio Paonessa and Yanick Schraner, Researchers at FHNW
Get ready for a glimpse into the future of natural interaction with computer systems in Swiss German! We leveraged the latest advancements in speech-to-text and text-to-speech technology to create an engaging and interactive experience that showcases the results of our cutting-edge research.
Exploring the Acceptance of Intelligent Voice Assistants in Home Care Applications: Opportunities and Obstacles [10 mins presentation, 10 mins discussion] Edith Birrer, Researcher at iHomeLab – HSLU (Hochschule Luzern)
In the scope of co-creation sessions, care workers provided insights on applications and on concerns about Intelligent Voice Assistants (IVA) in the home of their clients or patients. The sessions focused on the potential to support the care documentation process by IVA. Participants’ expectations and worries spanned from the ability to handle dialects, to confidentiality issues, to integration in existing care documentation systems. However, there is a general openness toward the idea to employ IVA as means to improve the quality of care. The challenge foreseen for using IVA is to become as time efficient as care documentation systems in place. Alternatively, as suggested by participants, IVA could complement existing processes or even create new ones in the care context.
If you want to join, please fill in the following registration form by April 27: https://forms.gle/PmRQENtY8aybJeby5 Please note that the registration form includes information for the SwissNLP General Assembly which is co-located.
The Data Innovation Alliance’s second Expert Day in March 2023 was a hub of activity as experts from four key areas – Smart Maintenance, NLP & AI Technology, Spatial Data, and Smart Services – gathered to share their insights and mingle with researchers and industry professionals. The event kicked off with leaders from each Expert Group pre-discussing their plans for 2023, generating a wealth of innovative ideas for joint events and initiatives, and paving the way for exciting collaborations in the (near) future.
But that’s not all! The NLP and Digital Health groups are teaming up to bring you joint events that will revolutionize the way we approach data. And with the next Expert Day set for August 2023, featuring four expert groups once again, get ready for even more ground-breaking discussions and initiatives, organized jointly with other Innovation Boosters. Keep an eye on our events calendar for more information.
While the keynote speech may not have met expectations in terms of insights, it set the stage for what was to come – dynamic discussions and collaborations in the expert group break sessions. To ensure everyone had access to the wealth of information shared, short summaries of the discussions were written by participants in each room.
In short, the second Expert Day was a superb success, bringing together a diverse group of experts to debate their ideas and shape the future of data innovation.
Smart Services for Sustainability – Circular Servitization by Jürg Meierhofer
The Smart Services for Sustainability – Circular Servitization discussion was a dynamic conversation among highly experienced individuals from different industries. They explored how value is created in business ecosystems, focusing on both individual and organizational perspectives.
It was inspiring to have diverse industry representatives in the same room and to create a common understanding. Departing from economic value creation, the group extended its scope to ecological factors. An intense discussion arose about how environmental value can be created without negatively impacting economic value. Statements that economic value creation is still the predominant requirement were made, meaning that in many cases, even a slight reduction of economic value for the sake of ecological value would be treated with suspicion. As sustainability becomes increasingly relevant and regulations loom, the balance between economic and ecological value may shift in the near future.
Overall, the Smart Services for Sustainability – Circular Servitization discussion was thought-provoking and left participants eager to continue exploring the intersection of business and sustainability.
Spatial Data by Reik Leiterer
In a room buzzing with ideas, each data expert chimed into the discussion about the creation of a platform that would benefit cantons, individuals, and service providers. There was a shared understanding that it might not be possible to cater to everyone’s needs and that a simpler visualization and analytics approach may be the way forward. However, some uncertainties still remained, such as identifying where the necessary data is available and how it can be integrated, setting limits, and ensuring that data is not misinterpreted. Despite these challenges, the group remained enthusiastic about the potential benefits of the platform and is looking forward to overcoming these obstacles.
NLP & AI Technology by Lina Scarborough
The group opened the floor with how chatbots are great to answer questions, but what happens when users don’t know where to begin asking questions? This is a common issue in legal situations where the average client may not have the necessary background to understand what information is needed. Retrieval augmented language models like KATIE have emerged as a solution to this problem. These models use grounded reasoning and promote a chain of thought to handle complex queries and create a context for users who may not know what subset of questions to ask.
With the rise of machine-generated text, it’s becoming more difficult to distinguish between human and machine-generated content. While probabilistic token selection and frameworks like SCARECROW can help scrutinize machine-generated text, it can still be difficult, to nigh impossible, to identify. However, ChatGPTZero, an app that uses watermarking to create a statistical fingerprint in the sampling method, claims to be able to detect whether an essay is written by ChatGPT or a human – for instance, ChatGPT generally makes redundancy errors whereas humans make grammatical mistakes. This approach hopes to maintain the integrity of human-generated content in the face of increased machine-generated text.
The discussion then flowed into a lively and engaging presentation on how AI technology can make the tricky SQL “minefield” as easy to navigate as a soccer player scoring a goal – literally, by demonstrating SQL prompts on the soccer World Cup!
Smart Maintenance by Melanie Geiger
The five use case presentations highlighted the versatility of data technology in different applications, showcasing how it can be adapted to meet various needs. With input data ranging from domain knowledge to error log data, these use cases demonstrated how AI models can process and analyze complex data sets to provide valuable insights and decision support.
One of the key themes that emerged was the use of AI for diverse condition-based maintenance, specifically anomaly detection and fault diagnosis. By leveraging ML algorithms, these use cases were able to detect potential issues and predict equipment failures for timely maintenance and preventing downtime.
The highlight of the event was not only the apèro treats, but the opportunity to engage with the 60 participants and learn about their projects, challenges, solutions, and ideas for collaboration. Many attendees seemed to share this sentiment, as numerous participants were still engrossed in conversation at the end of the event, and some discussions had to be continued elsewhere. Those who wish to follow up on these conversations have the option to do so at SDS2023. On a more lowkey note, maybe you wanted to add someone on LinkedIn and send them a message. Here you go, this is your reminder!
Our conclusion of the event: the Alliance has many experts in various subtopics of data-driven value creation, but only together we can move faster.
We invite you to the second iteration of the Expert Day. Join us in an exchange of expertise and find inspiration. These following groups will participate:
Natural Language Processing & Big Data Technologies
Smart Maintenance
Smart Services
Spatial Data Analytics
Detailed Program:
15:00 – Welcome 15:30 – Keynote by Prof. Pierre Dersin 16:10 – Expert Group Meetings in breakout rooms (see below) 17:40 – Apéro
Data-driven Value Added for Words, Images and Things
Digital transformation is a defining feature of our epoch.
Abundance of data, immense increase in hardware processing capabilities and breakthroughs in analytics algorithms have made practical some of the visions put forward about three quarters of a century ago. The branch of Artificial Intelligence called Machine Learning, and in particular Deep Learning, permeates image processing, natural language processing ( “ words”) and smart maintenance (‘things’), and furthermore enables rich synergies between those three fields, which span a great deal of human activity, with profound potential impacts—some already visible, on industry, science, the arts and social life.
Natural Language Processing & Big Data Technologies
Everyone is talking about ChatGPT these days and some of its output is truly impressive! We will discuss how the most recent wave of text generation algorithms can transform business, science and teaching. The meeting will feature the following expert talks:
Grounded Copywriting with ChatGPT & Co by Michael Wechner (Wyona AG) + Colin Carter (Coop Rechtsschutz) Everyone talks about the pros and cons of ChatGPT, its competitors and how to combine the generated text with grounded knowledge. We will demonstrate how ChatGPT & Co can be applied in insurance and discuss the future of retrieval augmented language models.
Can we Identify Machine-Generated Text? An Overview of Current Approaches by Anastassia Shaitarova (UZH Institute for Computational Linguistics)
The detection of machine-generated text has become increasingly important due to the prevalence of automated content generation and its potential for misuse. In this talk, we will discuss the motivation for automatic detection of generated text. We will present the currently available methods, including feature-based classification as a “first line-of-defense.” We will provide an overview of the detection tools that have been made available so far and discuss their limitations. Finally, we will reflect on some open problems associated with the automatic discrimination of generated texts.
Using AI to Query the Football World Cup Database in Natural Language by Kurt Stockinger (ZHAW Institute for Applied Information Technology)
Football is one of the most popular sports on earth with millions of people watching the FIFA world cup. In this talk, we describe how we built a system to query the world cup database in natural language. We explain how we translate natural language into the database query language SQL using modern transformer architecture. We also demonstrate how we have used large language models such as Open AI’s GPT-3 and Google’s T5 to explain how the system interprets users’ questions.
We are looking forward to exchanging opinions, experiences and questions, and to exploring this exciting field together!
Smart Maintenance
The value of condition monitoring data: 5 use cases.
In this meeting of the Smart Maintenance Expert Group we will hear about successful student projects conducted together with industry partners from various fields. The focus points of the projects are very diverse, ranging from prediction of energy losses, through anomaly detection, fault diagnostics, prediction of the remaining useful life and optimal maintenance scheduling. We will have 5 short pitch presentations, followed by an interactive discussion of future interest topics of our expert group, including active feedback of all participants.
Anomaly Detection in Marine Engines with Convolutional Neural Networks (Company: WinGD)
Aircraft Scheduling Optimization based on Prognostics Degradation Models (Company: Swiss International Airlines)
Modeling Wake Energy Losses in Wind Farms using Graph Neural Networks (Company: Fluence Energy)
Using Error Code Patterns to Predict Service Requests on Production Machines with Machine Learning (Company: Zünd Systemtechnik).
Fault Detection in Solar Power Plants using Physics Informed Deep Learning (Company: Fluence Energy)
Smart services for sustainability – circular servitization
With data-driven services, industrial companies can create quantifiable value for their customers, partners and themselves. At the same time, these services also have the potential for ecological benefits, e.g., through optimized processes in operations or logistics. To make this possible, economic and ecological goals must be captured in a targeted and combined manner when designing the services.
The 1.5-hour workshop will discuss how specific problems from everyday business can be systematically addressed to create relevant added value for business and ecology. Participants will bring their own business issue and leave the workshop with a first approach on how to create economic and environmental value through smart services. The workshop will run through typical phases of a project in a compressed time format to give an impression of what such a project might look like on a larger scale.
Spatial Data Analytics
High-quality spatial data is increasingly available for free use. However, with the large amount of data and the sometimes very specific data types and formats, it is challenging to find the appropriate data sources. In addition, some of the data access platforms are only partially intuitive and can be used without expert knowledge. Accordingly, the question arises whether the full potential of the available data base could not be better exploited if data access and data sharing were simplified. In this co-creation workshop, concepts and approaches will be reflected and discussed with representatives from research and industry as well as from cantonal and federal agencies, with the aim of developing possible approaches for joint implementation.
Natural Language Processing (NLP) offers fundamental solutions for tasks such as text classification, automated chatbots, text summarization or speech recognition. How can the Insurance industry benefit from these AI technologies? This is the core question for the upcoming Expert Group Meeting “NLP in Insurance”, which will take place on Montag 31.10.2022 at 16:30 at ZHAW Lagerstrasse, Zurich.
Felix Müller, Senior Data Scientist at Mobiliar, will share insights into their usage of NLP for different applications (e.g. using transformers for claim handling). This will be followed be an open discussion among all participants (experts and users from academia and industry) and a nice apero.
The meeting is jointly organized by the NLP Expert Group, the Expert Group AI in Finance and Insurance, and the Swiss Association for Natural Language Processing (SwissNLP).
To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
Functional
Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes.The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.