Neurosymbolic AI for EGI

Tutorial Abstract

Large Language Models (LLMs) are transforming Natural Language Processing tasks across multiple domains. Despite their capabilities, their real-world adoption is often limited by issues like the lack of transparency, inadequate understanding of domain protocols, and subpar precision. This tutorial introduces the concept of Neurosymbolic AI, which combines symbolic knowledge structures with statistical learning techniques to build more robust, explainable, and instructable LLMs. This tutorial aims to empower participants to deeply understand Neurosymbolic AI applied to Large Language Models (LLMs), addressing key challenges like explainability, grounding, and instructability (EGI).

Specific learning outcomes:

Understand limitations: Grasp the challenges of traditional black-box LLMs and the importance of EGI.
Design LLMs: Learn to create models that integrate process knowledge for better instructability.
Enhance grounding: Develop skills to strengthen LLMs in healthcare applications through reliable explanations.
Personalize responses: Explore methods for tailoring LLM outputs using domain-specific knowledge.
Ensure accountability: Assess outputs with a focus on provenance, reasoning, and transparency.

Schedule

2:00 PM - 3:00 PM: Introduction to Neurosymbolic AI and Knowledge-infused Learning.

3:00 PM - 3:45 PM: Vector Symbolic Architectures.

4:00 PM - 4:45 PM: Grounding Blackbox Language Models with RAG.

4:45 PM - 5:00 PM: Knowledge-infused Learning for Explainability and Instructibility.

5:00 PM - 5:30 PM: OpenCHA: Building Explainable and Personalized Conversational

Tutorial Organizers/Presenters

Deepa Tilwani LinkedIn

Deepa Tilwani is a PhD candidate in Computer Science at the Artificial Intelligence Institute, University of South Carolina, where she specializes in advancing Large Language Models, Artificial Intelligence, and Neurosymbolic AI. Her research lies at the intersection of AI and neuroscience, leveraging cutting-edge natural language processing, signal processing, and biosignal analysis to develop transformative solutions for complex neuroscience challenges. Recognized with prestigious awards and featured in leading publications, her work reflects significant impact and innovation in health care. Driven by a passion for discovery, she actively seeks opportunities to collaborate and contribute to groundbreaking advancements in this rapidly evolving field.

Ali (SeyedAli) Mohammadi LinkedIn Personal Webpage

Ali Mohammadi is a Ph.D. candidate at the University of Maryland, Baltimore County (UMBC), specializing in Explainable Artificial Intelligence, Natural Language Processing, and Knowledge Graphs. He conducts research at UMBC’s Knowledge-Inference and Knowledge-Infused AI Inference Lab. Prior to his doctoral studies, Ali was a lecturer, teaching courses in Image Processing, Artificial Intelligence, and Data Structures. He holds a master’s degree in Artificial Intelligence, further enriching his expertise in the field.

Edward Raff LinkedIn

Edward Raff is the Director of Emerging AI at Booz Allen Hamilton and a visiting professor at the University of Maryland, Baltimore County. Dr. Raff’s research toward solving client problems covers topics in Cyber Security, Reproducibility, Adversarial Machine Learning, High-Performance Computing, and Neuro-Symbolic methods. As such, he has been elected as a senior member of AAAI and the IEEE, published two books, 130+ papers, 6 best-paper awards, and 10+ patents. He has co-chaired the Conference on Applied Machine Learning for Information Security (CAMLIS) three times and co-chaired the AAAI workshop on Cyber Security (AICS) three times, was reproducibility chair of AAAI and a senior program member of multiple AI/ML conferences.

Iman Azimi LinkedIn

Iman Azimi is the assistant director at the UC Irvine - Institute for Future Health and an adjunct professor at the University of Turku. Iman holds a Ph.D. in Information and Communication Technology from the University of Turku and an MSc in Artificial Intelligence and Robotics from Sapienza University of Rome. His research spans machine learning, generative AI, and health data analysis. He works on multiple digital health projects, focusing on conversational health agents, mental health, maternal health and nutrition monitoring, and personalized data analytics utilizing AI techniques on diverse data sources, such as wearable devices.

Aman Chadha LinkedIn

Aman Chadha leads a team of Generative AI Scientists and Managers at AWS. Previously, he spearheaded Speaker Understanding & Personalization efforts at Amazon Alexa AI. Prior to Amazon, Aman was a key contributor to the Machine Intelligence Neural Design (MIND) team at Apple, where he trained on-device multimodal AI models for applications spanning Natural Language Processing, Computer Vision, and Speech Recognition. As one of the architects behind Apple's M1 chip, he developed machine learning models to predict the performance of future Macs years in advance. Before Apple, Aman honed his expertise in ML accelerators and GPUs during his tenure at Qualcomm and Nvidia. Aman specialized in Multimodal AI at Stanford University. He holds a Master’s in Computer Engineering from the University of Wisconsin-Madison, where he received the Outstanding Graduate Student Award, and a Bachelor’s in Electronics and Telecommunication Engineering with distinction from the University of Mumbai. He has published in leading conferences, such as ACL, EMNLP (Outstanding Paper Award '23), AAAI, EACL, ECIR, ECML, WSDM, WACV, ICASSP, etc. His work has been featured in outlets like The Washington Post, New Scientist, Analytics India Magazine, etc.

Manas Gaur LinkedIn

Manas Gaur is an assistant professor in the Department of Computer Science and Electrical Engineering at the University of Maryland Baltimore County. He leads the Knowledge-infused AI Inference Lab focusing on NeuroSymbolic AI, Explainable AI, Safe AI, Knowledge-infused Learning, Large Language Models, and Knowledge Graphs, with applications to mental health, cybersecurity, crisis informatics, and conversational systems. Previously he was a senior research scientist at Samsung Research America and a visiting researcher at Alan Turing Institute UK. He was an AI for Social Good Fellow at Dataminr Inc. and Eric Wendy Schmidt Data Science for Social Good Fellow at the University of Chicago. His research has received the best paper award in IEEE Internet Computing, IEEE Intelligent Systems and an honorable mention award at ACM CoDS COMAD. He was selected for AAAI New Faculty Highlights, and USC was awarded the Eminent Doctoral Profile award. He has been a guest editor on NeuroSymbolic AI and Large Language Models in IEEE Internet Computing and ACM Transactions on Computing for Health. He holds senior PC member or area chair for WWW, KDD, CIKM, AAAI, and ACL. He is Co-Chair of the International Semantic Web Conference. He has organized the first Tutorial on Knowledge-infused Learning (AAAI, ACM Hypertext, and Social Computing), Explainable AI using Knowledge Graphs (AI-ML Systems, KDD).

Tutorial Audience

This tutorial is designed to appeal to a broad audience, from graduate students seeking a foundational understanding of Neurosymbolic AI to industry professionals exploring its practical applications. Graduate students will benefit from a comprehensive introduction to the field, while faculty members can delve into cutting-edge research on knowledge graph-driven generative AI, particularly in healthcare. Industry researchers will gain valuable insights into grounding, instructability, and explainability in agents, and a hands-on demonstration will showcase how to integrate Neurosymbolic AI into real-world scenarios.

Prerequisites

While this tutorial is designed to be accessible, a foundational understanding of the following topics is recommended to maximize comprehension and engagement:

Probability and Statistics: Familiarity with basic probability concepts and statistical analysis methods.
Artificial Intelligence (AI): A working knowledge of AI principles, especially those related to language models and neural networks.
Knowledge Graphs: Understanding the structure and application of knowledge graphs in AI systems.
Machine Learning: Experience with machine learning concepts, including supervised and unsupervised learning techniques.
Data Mining: Basic insights into extracting patterns and knowledge from large datasets.

For those new to these areas, the tutorial will include a foundational overview and focus on providing intuitive explanations of complex concepts. Advanced topics will be presented with an emphasis on accessibility, ensuring participants of varied expertise can follow along and benefit from the session.

Takeaways

While grounding in AI is often associated with multimodality, it encompasses a broader concept [1]. Grounding refers to ensuring that an AI system's understanding is firmly rooted in domain-specific knowledge, guidelines, and expertise [2]. This is crucial for preventing superficial responses that often plague current LLMs [3]. By delving into the techniques and strategies for achieving stronger grounding, participants will learn how to:

Anchor AI models in factual information through guidelines, graphs, and domain knowledge bases [4]
Align AI responses with specific contexts and not provide an inconsistent and biased response
Mitigate the risk of AI generating inaccurate or misleading information through attribution

Slides

NeuroSymbolic AI for Grounding, Instructibility, and Explainability

References

Antonio Nucci. LLM Grounding: Techniques to Amplify AI Model Accuracy --- aisera.com. aisera.com/blog/llm-grounding, [Accessed 17-01-2025]
Bajaj G, Shalin VL, Parthasarathy S, Sheth A. Grounding From an AI and Cognitive Science Lens. IEEE Intelligent Systems. 2024 Apr 30;39(2):66-71.
Biderman S, Prashanth U, Sutawika L, Schoelkopf H, Anthony Q, Purohit S, Raff E. Emergent and predictable memorization in large language models. Advances in Neural Information Processing Systems. 2024 Feb 13;36.
Grounding in Large Language Models (LLMs) and AI | Generative AI Wiki --- attri.ai. https://attri.ai/generative-ai-wiki/grounding-in-large-language-models-llms-and-ai, [Accessed 17-01-2025]
Seyedali Mohammadi, Edward Raff, Jinendra Malekar, Vedant Palit, Francis Ferraro, and Manas Gaur. 2024. WellDunn: On the Robustness and Explainability of Language Models and Large Language Models in Identifying Wellness Dimensions . In Proceedings of the 7th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP, pages 364–388, Miami, Florida, US. Association for Computational Linguistics.
Deepa Tilwani, Yash Saxena, Ali Mohammadi, Edward Raff, Amit Sheth, Srinivasan Parthasarathy, and Manas Gaur. 2024. REASONS: A Benchmark for REtrieval and Automated CitationS Of scieNtific Sentences using Public and Proprietary LLMs . arXiv preprint, arXiv:2405.02228.

Tutorial: Neurosymbolic AI for EGI