Latest revision as of 21:11, 17 January 2023

Location: Hybrid - Buchanan 126, and the zoom link below

Time: Wednesdays at 10:30am, Mountain Time

Zoom link: https://cuboulder.zoom.us/j/97014876908


Date	Title
24.08.22	Planning, introductions, welcome!
31.08.22	PhD students present! Ongoing projects and opportunities iSAT: Jim, John, Maggie, Ananya, Zoe AIDA: Elizabeth, Rehan, Sijia
07.09.22	PhD students continue to present! 20 minutes per project KAIROS: Susan, Reece AmericasNLP: Katharina, Alexis, Abteen FOLTA: Alexis, Bhargav, Enora, Michael StoryGenerations: Katharina, Maria, Trevor
14.09.22	And more presentations from our fabulous students and colleagues! Assisted interviewing: DJ, Abe THYME: UMR: Martha and Jim
21.09.22	lunch at the Taj
28.09.22	James Pustejovsky Dense Paraphrasing for Textual Enrichment: Question Answering and Inference Abstract: Much of the current computational work on inference in NLP can be associated with one of two techniques: the first focuses on a specific notion of text-based question answering (QA), using large pre-trained language models (LLMs). To examine specific linguistic properties present in the model, « probing tasks » (diagnostic classifiers) have been developed to test capabilities that the LLM demonstrates on interpretable semantic inferencing tasks, such as age and object comparisons, hypernym conjunction, antonym negation, and others. The second is Knowledge Graph-based inference and QA, where triples are mined from Wikipedia, ConceptNet, WikiData, and other non-corpus resources, and then used for answering questions involving multiple components of the KG (multi-hop QA). While quite impressive with benchmarked metrics in QA, both techniques are completely confused by (a) syntactically missing semantic content, and (b) the semantics accompanying the consequences of events and actions in narratives. In this talk, I discuss a model we have developed to enrich the surface form of texts, using type-based semantic operations to « textually expose » the deeper meaning of the corpus that was used to make the original embeddings in the language model. This model, Dense Paraphrasing, is a linguistically-motivated, textual enrichment strategy, that textualizes the compositional operations inherent in a semantic model, such as Generative Lexicon Theory or CCG. This involves broadly three kinds of interpretive processes: (i) recognizing the diverse variability in linguistic forms that can be associated with the same underlying semantic representation (paraphrases); (ii) identifying semantic factors or variables that accompany or are presupposed by the lexical semantics of the words present in the text, through dropped, hidden or shadow arguments; and (iii) interpreting or computing the dynamic consequences of actions and events in the text. After performing these textual enrichment algorithms, we fine-tune the LLM which allows more robust inference and QA task performance. James Pustejovsky, Professor TJX Feldberg Chair in Computer Science Department of Computer Science Chair of CL MS Program Chair of Linguistics Program
05.10.22	Martha, COLING keynote // Daniel poster presentation dry run
12.10.22	COLING / paper review
19.10.22	CLASIC Open House, 11am-1pm This is largely an informational event for students interested in the CLASIC (Computational Linguistics, Analytics, Search, and InformatiCs) Master's program and/or the new LING to CLASIC BAM program. The event will include short talks from graduates of the CLASIC program, and then lunch. Please register if you're interested - until 5pm Monday October 17th.
21.10.22	FRIDAY Carolyn Rose, Carnegie Mellon(ICS/iSAT event) Special time and place: 11am-12:15pm MT, Muenzinger D430 / zoom Title: A Layered Model of Learning during Collaborative Software Development: Programs, Programming, and Programmers Collaborative software development, whether synchronous or asynchronous, is a creative, integrative process in which something new comes into being through the joint engagement, something new that did not fully exist in the mind of any one person prior to the engagement. One can view this engagement from a macro-level perspective, focusing on large scale development efforts of 100 or more developers, organized into sub-teams, producing collections complex software products like Mozilla. Past work in the area of software engineering has explored the symbiosis between the management structure of a software team and the module structure of the resulting software. In this talk, we focus instead on small scale software teams of between 2 and 5 developers, working on smaller-scale efforts of between one hour and 9 months, through more fine grained analysis of collaborative processes and collaborative products. In this more tightly coupled engagement within small groups, we see again a symbiosis between people, processes, and products. This talk bridges between the field of Computer-Supported Collaborative Learning and the study of software teams in the field of Software Engineering by investigating the inner-workings of small scale collaborative software development. Building on over a decade of AI-enabled collaborative learning experiences in the classroom and online, in this talk we report our work in progress beginning with classroom studies in large online software courses with substantial teamwork components. In our classroom work, we have adapted an industry standard team practice referred to as Mob Programming into a paradigm called Online Mob Programming (OMP) for the purpose of encouraging teams to reflect on concepts and share work in the midst of their project experience. At the core of this work are process mining technologies that enable real time monitoring and just-in-time support for learning during productive work. Recent work on deep-learning approaches to program understanding bridge between investigations of processes and products.
26.10.22	No meeting -- go to Barbara's talk on Friday, and Nathan's on Monday!
28.10.22	FRIDAY Barbara diEugenio (ICS talk, noon) Special time and place: 12-1:30pm MT, Muenzinger D430 / zoom Title: Knowledge Co-Construction and Initiative in Peer Learning for introductory Computer Science Peer learning has often been shown to be an effective mode of learning for all participants; and knowledge co-construction (KCC), when participants work together to build knowledge, has been shown to correlate with learning in peer interactions. However, KCC is hard to identify and/or support computationally. We conducted an extensive analysis of a corpus of peer-learning interactions in introductory Computer Science: we found a strong relationship between KCC and the linguistic notion of initiative shift, and moderate correlations between initiative shifts and learning. The results of this analysis were incorporated into KSC-PaL, an artificial agent that can collaborate with a human student via natural-language dialog and actions within a graphical workspace. Evaluations of KSC-PaL showed that the agent was able to encourage shifts in initiative in order to promote learning and that students learned using the agent. This work (joint with Cindy Howard, now at Lewis University), was part of two larger projects that studied tutoring dialogues and peer learning interactions for introductory Computer Science, and that resulted in two Intelligent Tutoring Systems, iList and Chiqat-Tutor. Barbara Di Eugenio, PhD, Professor and Director of Graduate Studies, Department of Computer Science, University of Illinois, Chicago
31.10.22	MONDAY Nathan Schneider (Ling Circle Talk, 4pm) Special time and place: 4pm, UMC 247 / zoom (passcode: 795679) Title: The Ins and Outs of Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Automatic Disambiguation In most linguistic meaning representations that are used in NLP, prepositions fly under the radar. I will argue that they should instead be put front and center given their crucial status as linkers of meaning—whether for spatial and temporal relations, for predicate-driven roles, or in special constructions. To that end, we have sought to characterize and disambiguate semantic functions expressed by prepositions and possessives in English (Schneider et al., ACL 2018), and similar markers in other languages (Mandarin Chinese, Korean, Hindi, and German). This approach can be broadened to other constructions and integrated in full-sentence lexical semantic tagging as well as graph-structured meaning representation parsing. Other investigations include crowdsourced annotation, contextualized preposition embeddings, and preposition use in fluent nonnative English. Nathan Schneider, Associate Professor, Depts. of Computer Science and Linguistics, Georgetown University
02.11.22	* No meeting - UMRs team at Brandeis *
09.11.22	Practice talks Ananya Ganesh: "CHIA: CHoosing Instances to Annotate for Machine Translation" (accepted to Findings of EMNLP; practicing for the video recording) Abteen Ebrahimi: "Second AmericasNLP Competition on Speech-to-Text Translation for Indigenous Languages of the Americas" (NeurIPS competition; practicing for the in-person presentation of the competition)
16.11.22	Maggie Perkoff, prelim Title: Who said it best? A Thematic Analysis of Open Domain Response Generation Systems Open domain response generation is a rapidly increasing field of natural language processing research. This type of system can be embedded in social chatbots, teaching assistants, and even therapy sessions. The open domain space is defined by the absence of a specific task that the user is trying to achieve by engaging with the conversational agent. A variety of methods have been proposed to improve the capability of these models including knowledge grounded systems, persona embeddings, and transformer models trained on vast datasets. Some of these systems use automated metrics for evaluation alongside human annotators for response quality - but there is no standard assessment for what makes an open domain dialogue system 'better' than any other. This paper seeks to identify broad categories of response generation systems and analyze them based on different themes in open domain conversation: engagement, consistency, correctness, personality, and toxicity.
23.11.22	* No meeting - fall break *
30.11.22	Postponed to Spring 2023: HuggingFace demo - Trevor Ward
07.12.22	Coffee and pastries

@@ Line 1: / Line 1: @@
+'''Location:''' Hybrid - Buchanan 126, and the zoom link below
+'''Time:''' Wednesdays at 10:30am, Mountain Time
+'''Zoom link:''' https://cuboulder.zoom.us/j/97014876908
 {| class="wikitable" border="1"
 |+
@@ Line 4: / Line 10: @@
 ! Title
 |- style="border-top: 2px solid DarkGray;"
-| 9.2.20 || Planning
 |- style="border-top: 2px solid DarkGray;"
-| 9.9.20 ||  More planning
+| 24.08.22 || '''Planning, introductions, welcome!'''
 |- style="border-top: 2px solid DarkGray;"
-| 9.16.20 || Vivek Srikumar -   Title: Fads, Fallacies and Fantasies in the Name of Machine Learning
+| 31.08.22 || PhD students present! Ongoing projects and opportunities
+* iSAT: Jim, John, Maggie, Ananya, Zoe
+* AIDA: Elizabeth, Rehan, Sijia
+|- style="border-top: 2px solid DarkGray;"
+| 07.09.22 || PhD students continue to present!
+minutes per project
+* KAIROS: Susan, Reece
+* AmericasNLP: Katharina, Alexis, Abteen
+* FOLTA: Alexis, Bhargav, Enora, Michael
+* StoryGenerations: Katharina, Maria, Trevor
-Abstract: The pervasiveness of machine learning, and artificial intelligence powered by it, is clear from even a cursory overview of the last several years of academic literature and mainstream technology reporting.  The goal of this talk is to provoke thought and discussions about the future of the field. To this end, I will talk about how, as a field, applied machine learning may be starting to bind itself into an intellectual monoculture. In particular, I will describe specific blinders that we may find hard to shake off: (a) the obsession with ranking and leaderboarding, (b) the assumption that purely data-driven computing is always the right answer, and (c) the excessive focus on clean toy problems in lieu of working with real data. Along the way, we will see several examples of questions that we may be able to think about if we cast aside these blinders.
-Bio: Vivek Srikumar is associate professor in the School of Computing at the University of Utah.  His research lies in the areas of natural learning processing and machine learning and has primarily been driven by questions arising from the need to reason about textual data with limited explicit supervision and to scale NLP to large problems. His work has been published in various AI, NLP and machine learning venues and has been recognized by paper awards from EMNLP and CoNLL. His work has been supported by awards from NSF, BSF and NIH, and also from several companies.. He obtained his Ph.D. from the University of Illinois at Urbana-Champaign in 2013 and was a post-doctoral scholar at Stanford University.
+|- style="border-top: 2px solid DarkGray;"
+| 14.09.22 || And more presentations from our fabulous students and colleagues!
-Recording of Vivek's presentation: https://drive.google.com/file/d/1KKbU46LJbCFIgSF-CcpdgHMq4nZmyNt2/view?usp=sharing
+* Assisted interviewing: DJ, Abe
+* THYME:
+* UMR: Martha and Jim
 |- style="border-top: 2px solid DarkGray;"
-| 9.23.20 || 2 papers:
+| 21.09.22 || lunch at the Taj
-Paper 1 - Jonas Pfeiffer (grad student, TU Darmstadt), MAD-X: An Adapter-based Framework for Multi-task Cross-lingual Transfer,
+|- style="border-top: 2px solid DarkGray;"
-Jonas Pfeiffer, Ivan Vulić, Iryna Gurevych, Sebastian Ruder, EMNLP 2020. https://arxiv.org/pdf/2005.00052.pdf
+| 28.09.22 || James Pustejovsky
-Abstract: The main goal behind state-of-the-art pretrained multilingual models such as multilingual BERT and XLM-R is enabling and bootstrapping NLP applications in low-resource languages through zero-shot or few-shot cross-lingual transfer. However, due to limited model capacity, their transfer performance is the weakest exactly on such low-resource languages and languages unseen during pretraining. We propose MAD-X, an adapter-based framework that enables high portability and parameter-efficient transfer to arbitrary tasks and languages by learning modular language and task representations. In addition, we introduce a novel invertible adapter architecture and a strong baseline method for adapting a pretrained multilingual model to a new language. MAD-X outperforms the state of the art in cross-lingual transfer across a representative set of typologically diverse languages on named entity recognition and achieves competitive results on question answering.
+Dense Paraphrasing for Textual Enrichment: Question Answering and Inference
+Abstract: Much of the current computational work on inference in NLP can be associated with one of two techniques: the first focuses on a specific notion of text-based question answering (QA), using large pre-trained language models (LLMs). To examine specific linguistic properties present in the model, « probing tasks » (diagnostic classifiers) have been developed to test capabilities that the LLM demonstrates on interpretable semantic inferencing tasks, such as age and object comparisons, hypernym conjunction, antonym negation, and others. The second is Knowledge Graph-based inference and QA, where triples are mined from Wikipedia, ConceptNet, WikiData, and other non-corpus resources, and then used for answering questions involving multiple components of the KG (multi-hop QA). While quite impressive with benchmarked metrics in QA, both techniques are completely confused by (a) syntactically missing semantic content, and (b) the semantics accompanying the consequences of events and actions in narratives.
+In this talk, I discuss a model we have developed to enrich the surface form of texts, using type-based semantic operations to « textually expose » the deeper meaning of the corpus that was used to make the original embeddings in the language model. This model, Dense Paraphrasing, is a linguistically-motivated, textual enrichment strategy, that textualizes the compositional operations inherent in a semantic model, such as Generative Lexicon Theory or CCG. This involves broadly three kinds of interpretive processes: (i) recognizing the diverse variability in linguistic forms that can be associated with the same underlying semantic representation (paraphrases); (ii) identifying semantic factors or variables that accompany or are presupposed by the lexical semantics of the words present in the text, through dropped, hidden or shadow arguments; and (iii) interpreting or computing the dynamic consequences of actions and events in the text. After performing these textual enrichment algorithms, we fine-tune the LLM which allows more robust inference and QA task performance.
+James Pustejovsky, Professor
+TJX Feldberg Chair in Computer Science
+Department of Computer Science
+Chair of CL MS Program
+Chair of Linguistics Program
-Paper 2 - Extending Multilingual BERT to Low-Resource Languages https://arxiv.org/abs/2004.13640
 |- style="border-top: 2px solid DarkGray;"
-| 9.30.20 || Jon's Prelim
+| 05.10.22 || Martha, COLING keynote // Daniel poster presentation dry run
 |- style="border-top: 2px solid DarkGray;"
-| 10.7.20 || Peter Foltz "NLP for Team Communication Analysis"
+| 12.10.22 || COLING / paper review
-|- style="border-top: 2px solid DarkGray;
+|- style="border-top: 2px solid DarkGray;"
-| 10.14.20 || 2 papers:
+| 19.10.22 || CLASIC Open House, 11am-1pm
-Paper 1 - Tao Li (Utah CS PhD student) - Structured Tuning for Semantic Role Labeling, ACL 2020
-Authors: Tao Li, Parth Anand Jawale, Martha Palmer, Vivek Srikumar, https://www.aclweb.org/anthology/2020.acl-main.744.pdf
-Abstract: Recent neural network-driven semantic role labeling (SRL) systems have shown impressive improvements in F1 scores. These improvements are due to expressive input representations, which, at least at the surface, are orthogonal to knowledge-rich constrained decoding mechanisms that helped linear SRL models. Introducing the benefits of structure to inform neural models presents a methodological challenge. In this paper, we present a structured tuning framework to improve models using softened constraints only at training time. Our framework leverages the expressiveness of neural networks and provides supervision with structured loss components. We start with a strong baseline (RoBERTa) to validate the impact of our approach, and show that our framework outperforms the baseline by learning to comply with declarative constraints. Additionally, our experiments with smaller training sizes show that we can achieve consistent improvements under low-resource scenarios.
+This is largely an informational event for students interested in the CLASIC (Computational Linguistics, Analytics, Search, and InformatiCs) Master's program and/or the new LING to CLASIC BAM program. The event will include short talks from graduates of the CLASIC program, and then lunch. Please [https://forms.gle/PSNihAGyX7y8aVGV8 register] if you're interested - until 5pm Monday October 17th.
+|- style="border-top: 2px solid DarkGray;"
+| 21.10.22 || '''FRIDAY''' Carolyn Rose, Carnegie Mellon(ICS/iSAT event)
-Paper 2 - Stephane Aroca-ouellette EMNLP 2020 practice talk on exploring auxiliary tasks for BERT
+'''Special time and place:''' 11am-12:15pm MT, Muenzinger D430 / [https://cuboulder.zoom.us/j/97658438049 zoom]
+'''Title:''' A Layered Model of Learning during Collaborative Software Development: Programs, Programming, and Programmers
+Collaborative software development, whether synchronous or asynchronous, is a creative, integrative process in which something new comes into being through the joint engagement, something new that did not fully exist in the mind of any one person prior to the engagement.  One can view this engagement from a macro-level perspective, focusing on large scale development efforts of 100 or more developers, organized into sub-teams, producing collections complex software products like Mozilla.  Past work in the area of software engineering has explored the symbiosis between the management structure of a software team and the module structure of the resulting software.  In this talk, we focus instead on small scale software teams of between 2 and 5 developers, working on smaller-scale efforts of between one hour and 9 months, through more fine grained analysis of collaborative processes and collaborative products.  In this more tightly coupled engagement within small groups, we see again a symbiosis between people, processes, and products.  This talk bridges between the field of Computer-Supported Collaborative Learning and the study of software teams in the field of Software Engineering by investigating the inner-workings of small scale collaborative software development.  Building on over a decade of AI-enabled collaborative learning experiences in the classroom and online, in this talk we report our work in progress beginning with classroom studies in large online software courses with substantial teamwork components.  In our classroom work, we have adapted an industry standard team practice referred to as Mob Programming into a paradigm called Online Mob Programming (OMP) for the purpose of encouraging teams to reflect on concepts and share work in the midst of their project experience.  At the core of this work are process mining technologies that enable real time monitoring and just-in-time support for learning during productive work.  Recent work on deep-learning approaches to program understanding bridge between investigations of processes and products.
+|- style="border-top: 2px solid DarkGray;"
+| 26.10.22 || '''No meeting -- go to Barbara's talk on Friday, and Nathan's on Monday!'''
 |- style="border-top: 2px solid DarkGray;"
-| 10.21.20 || Two Brian Keegan PhD students: Arcadia Zhang and Jordan Wirfs-Brock
+| 28.10.22 || '''FRIDAY''' [https://cs.uic.edu/profiles/barbara-di-eugenio/ Barbara diEugenio] (ICS talk, noon)
+'''Special time and place:''' 12-1:30pm MT, Muenzinger D430 / [https://cuboulder.zoom.us/j/97658438049 zoom]
+'''Title:''' Knowledge Co-Construction and Initiative in Peer Learning for introductory Computer Science
+Peer learning has often been shown to be an effective mode of learning for all participants; and knowledge co-construction (KCC), when participants work together to build knowledge, has been shown to correlate with learning in peer interactions. However, KCC is hard to identify and/or support  computationally. We conducted an extensive analysis of  a corpus of peer-learning interactions in introductory Computer Science: we found a strong relationship between KCC and the linguistic notion of initiative shift,  and moderate correlations between initiative shifts and learning. The results of this analysis were incorporated into KSC-PaL, an artificial agent that can collaborate with a human student via natural-language dialog and actions within a graphical workspace. Evaluations of KSC-PaL showed that the agent was able to encourage shifts in initiative in order to promote learning and that students learned using the agent. This work (joint with Cindy Howard, now at Lewis University), was part of  two larger projects that studied tutoring dialogues and peer learning interactions for introductory Computer Science, and that resulted in two Intelligent Tutoring Systems, iList and Chiqat-Tutor.
+Barbara Di Eugenio, PhD,
+Professor and Director of Graduate Studies,
+Department of Computer Science,
+University of Illinois, Chicago
 |- style="border-top: 2px solid DarkGray;"
-| 10.28.20 || Chelsea Proposal
+| 31.10.22 || '''MONDAY''' [https://people.cs.georgetown.edu/nschneid/ Nathan Schneider] (Ling Circle Talk, 4pm)
+'''Special time and place:''' 4pm, UMC 247 / [https://cuboulder.zoom.us/j/94447045287 zoom] (passcode: 795679)
+'''Title:''' The Ins and Outs of Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Automatic Disambiguation
+In most linguistic meaning representations that are used in NLP, prepositions fly under the radar. I will argue that they should instead be put front and center given their crucial status as linkers of meaning—whether for spatial and temporal relations, for predicate-driven roles, or in special constructions. To that end, we have sought to characterize and disambiguate semantic functions expressed by prepositions and possessives in English (Schneider et al., ACL 2018), and similar markers in other languages (Mandarin Chinese, Korean, Hindi, and German). This approach can be broadened to other constructions and integrated in full-sentence lexical semantic tagging as well as graph-structured meaning representation parsing. Other investigations include crowdsourced annotation, contextualized preposition embeddings, and preposition use in fluent nonnative English.
+Nathan Schneider,
+Associate Professor,
+Depts. of Computer Science and Linguistics,
+Georgetown University
 |- style="border-top: 2px solid DarkGray;"
-| 11.4.20 ||  Skatje Proposal?
+| 02.11.22 || *** No meeting - UMRs team at Brandeis ***
 |- style="border-top: 2px solid DarkGray;"
-| 11.11.20 || Rehan Proposal
+| 09.11.22 || Practice talks
+* Ananya Ganesh: "CHIA: CHoosing Instances to Annotate for Machine Translation" (accepted to Findings of EMNLP; practicing for the video recording)
+* Abteen Ebrahimi: "Second AmericasNLP Competition on Speech-to-Text Translation for Indigenous Languages of the Americas" (NeurIPS competition; practicing for the in-person presentation of the competition)
 |- style="border-top: 2px solid DarkGray;"
-| 11.18.20 || NAACL submission workshop
+| 16.11.22 || Maggie Perkoff, prelim
+'''Title:''' Who said it best? A Thematic Analysis of Open Domain Response Generation Systems
+Open domain response generation is a rapidly increasing field of natural language processing research.  This type of system can be embedded in social chatbots, teaching assistants, and even therapy sessions.  The open domain space is defined by the absence of a specific task that the user is trying to achieve by engaging with the conversational agent.  A variety of methods have been proposed to improve the capability of these models including knowledge grounded systems, persona embeddings, and transformer models trained on vast datasets.  Some of these systems use automated metrics for evaluation alongside human annotators for response quality - but there is no standard assessment for what makes an open domain dialogue system 'better' than any other.  This paper seeks to identify broad categories of response generation systems and analyze them based on different themes in open domain conversation: engagement, consistency, correctness, personality, and toxicity.
 |- style="border-top: 2px solid DarkGray;"
-| 11.25.20 || Fall Break
+| 23.11.22 || *** No meeting - fall break ***
 |- style="border-top: 2px solid DarkGray;"
-| 12.2.20 || Tentative Abhidip's proposal
+| 30.11.22|| '''Postponed to Spring 2023:''' HuggingFace demo - Trevor Ward
 |- style="border-top: 2px solid DarkGray;"
-| 12.9.20 || Tentative Vivian proposal
+| 07.12.22 ||  Coffee and pastries
 |- style="border-top: 2px solid DarkGray;"
+|}
-|}
 =Past Schedules=
+* [[Fall 2022 Schedule]]
+* [[Spring 2022 Schedule]]
+* [[Fall 2021 Schedule]]
+* [[Spring 2021 Schedule]]
+* [[Fall 2020 Schedule]]
 * [[Spring 2020 Schedule]]
 * [[Fall 2019 Schedule]]

Difference between revisions of "Fall 2022 Schedule"

Latest revision as of 21:11, 17 January 2023

Past Schedules

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools