Online events

Unstructured Data: Bridging Social Sciences & NLP/LLM Research Thru Open Science

Thursday 5 September 2024

This event has finished.

Started 12:05 PM

Finished 12:35 PM

Organized by ODSC Lisbon Data Science

Venue: Online/Virtual

Address: Online event on your device

See other Online events

Copy this link to share the event with anyone:

Share to social media:

About this event

**To access this session, please register here: [](**

**Topic:** Unlocking Unstructured Data: Bridging Social (Survey) Sciences and NLP/LLM Research Through Open Science

**Speaker:** Prof. Frauke Kreuter / Chair of Statistics and Data Science / LMU Munich

*Professor Frauke Kreuter is the Professor of Statistics and Data Science in Social Sciences and the Humanities at the Ludwig-Maximilians-University of Munich, Germany; Co-director of the Social Data Science Center (SoDa), and faculty member in the Joint Program in Survey Methodology (JPSM) at the University of Maryland, USA; and until recently head of the statistical methods group at the Institute for Employment Research (IAB) in Nuremberg, Germany. She is an elected fellow of the American Statistical Association and the 2020 recipient of the Warren Mitofsky Innovators Award of the American Association for Public Opinion Research. In addition to her academic work, Dr. Kreuter is the Founder of the International Program for Survey and Data Science, developed in response to the increasing demand from researchers and practitioners for the appropriate methods and right tools to face a changing data environment; Co-Founder of the Coleridge Initiative, whose goal is to accelerate data-driven research and policy around human beings and their interactions for program management, policy development, and scholarly purposes by enabling efficient, effective, and secure access to sensitive data about society and the economy; and Co-Founder of the German language podcast Dig Deep.*


The vast availability of unstructured data presents a significant opportunity for social sciences, yet there is a pressing need for better tools and infrastructure to access and utilize this data effectively. This talk will highlight how the Business and Economic Research Data Infrastructure Program BERD@NFDI is addressing these needs, showcasing achievements and inviting further collaboration within the European social science community.

Simultaneously, the fields of Natural Language Processing (NLP) and Large Language Models (LLMs) require high-quality training data. Social scientists have been collecting valuable data for decades, which can serve as essential benchmarks for advancing NLP and LLM research. By embracing open science, we can bridge the gap between social science and computational research, making this data more accessible and fostering collaboration across disciplines.

**ODSC Links:**

• Get free access to more talks/trainings like this at Ai+ Training platform:


• ODSC blog: [](

• Facebook: [](

• Twitter: []( & @odsc

• LinkedIn: [](

• Slack Channel: [](

• Code of conduct: [](

This page last updated Tuesday 3 September 2024 at 23:43.

Problems? Report an error or inappropriate listing here.

Information displayed here is provided in good faith but we are not responsible for the content of any listing. Sometimes events can be cancelled or changed at short notice. Please check with the venue or organizer before you travel!

Oh no. Javascript is switched off in your browser.
Some bits of this website may not work unless you switch it on.