Statement
I have a strong multidisciplinary background and an interest for exploring ideas that crosses traditional academic boundaries. My primary research is in the application of natural language processing for computationally modeling and generating narratives. Although centered on core areas of Computer Science, my research and experience touches upon the arts, humanities and social sciences. Bringing these disciplines together is necessary to develop computational systems and agents that have a deep understanding of human activities, social behavior and commonsense beliefs. This knowledge is critical for the future of successful human computer interaction.
Education
2005-2010 Los Angeles, CA |
Ph.D., Computer Science
Advisor: Andrew S. Gordon Viterbi School of Engineering University of Southern California |
2003-2005 Los Angeles, CA |
M.A., Computational Linguistics
College of Letters, Arts and Sciences University of Southern California |
1994-1999 Los Angeles |
B.A., Fine Art
School of the Arts and Architecture University of California Los Angeles |
1994-1999 Los Angeles |
B.A. Philosophy
College of Letters, Arts and Sciences University of California Los Angeles |
Research Experience
2017-Present Independent |
Independent Research Various I have been working on a number of research and development projects. I am investigating semi-supervised techniques for word sense disambiguation. Using discourse analysis to analyze threaded discussions to highlight more informative posts, provide summaries of large discussions, and develop reputations systems that reward a diversity of thoughts rather than groupthink. Finally, I am also investigating techniques in sports analytics with an emphasis on Fantasy team management and applications.
|
Aug 2015-2017 USC/ICT Los Angeles, CA |
Research Scientist Interpretation of Personal Narratives My research at the Institute for Creative Technologies focused on using natural language processing to analyze narrative discourse. One aspect of this research is to develop techniques for rapidly authoring interactive stores and driving these simulations based on models learned from real-world narratives. We also investigated how narrative discourse structure of personal stories affects attitude change in individuals regardless of the content of the message.
|
Sep 2013-May 2015 UC Santa Cruz Santa Cruz, CA |
Postdoctoral Scholar Opinion Sharing Dialog in Social Media (PI: Marilyn Walker)
Most automated analysis of discourse related to persuasion and argumentation focuses on the propositional content of the discussion. In contrast to this work, we are examining the role subjective language plays in they way discussion evolve in online dialogs. We are exploring several types of subjective language such as sarcasm, stance and opinion versus fact. We are investigating the nature of how people perceive this language and the reliability of judgments on these kinds of tasks. Guided by our users studies we are using machine learning to automatically identify this language in posts for further large-scale analyses. |
Apr 2012-Jun 2013 UC Santa Cruz Santa Cruz, CA |
Postdoctoral Scholar IMMERSE (PI: Michael Mateas)
Most automated analysis of discourse related to persuasion and argumentation focuses on the propositional content of the discussion. In contrast to this work, we are examining the role subjective language plays in they way discussion evolve in online dialogs. We are exploring several types of subjective language such as sarcasm, stance and opinion versus fact. We are investigating the nature of how people perceive this language and the reliability of judgments on these kinds of tasks. Guided by our users studies we are using machine learning to automatically identify this language in posts for further large-scale analyses. |
Jun 2011-Apr 2012 Jun 2013-Sep 2013 UC Santa Cruz Santa Cruz, CA |
Postdoctoral Scholar SIREN (PI: Arnav Jhala)
The SIREN project is a large European Union grant with partners in over 5 countries that are developing educational games to help children improve the way they handle conflicts. My primary focus on this project is to leverage current psychological theories of interpersonal conflicts to create natural and dynamic character models by blending large-scale data driven approaches with hand authored methods. |
Apr 2010-Apr 2011 WDI R&D Glendale, CA |
Research Lab Associate Interactive Storytelling At Disney I developed technologies for supporting several types of computer aided interactive storytelling applications. In particular, researching the latest classical, probabilistic and case-based planning techniques to be used for automated show control, visualization strategies for improving authoring efficacy and efficiency for interactive stories, and large scale ontology and semantic relationship extraction from the web to improve the scope of knowledge our system could reason about.
|
2004-2009 USC/ICT Los Angeles, CA |
Research Assistant Thesis Topic: Say Anything I developed an open-domain, interactive narrative generation system, where a person and computer take turns writing sentences of an emerging fictional narrative. A combination of information retrieval, machine learning and natural language processing enable the system to return sentences that produce stories nearly indistinguishable from weblog stories written entirely by humans. Although primarily conceived in the entertainment domain, it also has several other promising applications, such as a tool for children’s education, an automated improvisational exercise, a writer’s aid or general inference mechanism.
Masters Project
I developed a specialized search engine for retrieving first person narrative stories. As part of the project a story classification program was developed to identify narrative sentences using a Support Vector Machine classifier and a Gaussian filter. A custom retrieval engine was used to find relevant stories from a large collection gathered using Google Blog Search and the story classifier. To find relevant stories the retrieval engine used a vector space model with lexical, part-of-speech and semantic role label features extracted from a custom SRL system.
Natural Language Processing
In addition to the natural language processing that is implicitly necessary to support my thesis project I have also explicitly explored several core NLP topics. These include techniques for domain adaptation in semantic role labeling, applying unsupervised linguistic parsing techniques to other structured data such as music and I have also explored syntactic language models for automatic speech recognition.
Common Sense Reasoning
As part-of a research project in our lab to create several resources around a large- scale ontology of common-sense psychology, I helped organize data collection sessions for gathering natural language expressions capturing concepts in this ontology. The data was then used to author finite state transducers capable of recognizing and tagging these concepts in English text. In addition to collecting natural language expressions for these theories I also tried to validate the theory of Memory, one of Gordon & Hobbs core common-sense theories. The validation was performed by proving several common-sense strategies for Memory storage and retrieval using an automatic theorem solver and a first-order representation of the theory and strategies.
|
Summer 2006 IBM Watson Hawthorn, NY |
IBM Watson Research Center Internship Semantic Network Visualization During this internship I designed a mixed reality visualization system for aiding in the development of real-world large-scale semantic networks used in IBM’s Semantic Engine. The visualization used clustering techniques to isolate semantically similar sub-groups of the network. Each sub-group was arranged independently using a standard 2D graph layout algorithm. These 2D clusters were then mapped onto the 3D surface of a globe using an API for Google Earth. The inherent shape of the earth naturally emphasized relevant information by occluding less similar parts of the network beyond the horizon and the resulting interface made inspecting and traversing graphs of thousands of nodes and links easier through clear visual feedback. The interface could be traversed normally using the standard Google Earth client, but also through the manipulation of a physical globe in the real world, using RFID tags to track the position of the globe.
|
Teaching Experience
Winter 2015 UC Santa Cruz Santa Cruz, CA |
Lecturer CMPS 245: Computational Models of Discourse and Dialogue This is a graduate course discussing current and foundational work in special topics related to computational models of discourse and dialogue. This quarter focused on several deep semantic contrasted with statistical representations of narrative discourse.
https://courses.soe.ucsc.edu/courses/cmps179/Spring13/02 |
Spring 2014 UC Santa Cruz Santa Cruz, CA |
Co-Instructor (Unofficial) CMPS 143: Introduction to Natural Language Processing This course provides an introduction to the theory and practice of natural language processing. The course was primarily run by Marilyn Walker, however I helped build the syllabus, design the assignments and ran about half of the lectures.
http://courses.soe.ucsc.edu/courses/cmps143/Spring14/01/pages/nlpsyllabus |
Fall 2013 UC Santa Cruz Santa Cruz, CA |
Lecturer CMPS 012B: Introduction to Data Structures A basic introduction to basic data structures and algorithms. Teaches fundamental data structures and algorithms in C and Java. Topics include big O notation, linked lists, stacks, queues, binary search trees, hash tables and sorting. Course material largely provided by previous instructor.
|
Spring 2013 UC Santa Cruz Santa Cruz, CA |
Lecturer CMPS 179: UX Techniques for Designing 3D, Animation, and Interaction for the Web This course explored techniques for interactive data visualization using new vector and 3D graphics capabilities of HTML 5. The course emphasized both technical and creative aspects of interactive visualization through the development of several projects using real world data acquired from eBay’s public API.
https://courses.soe.ucsc.edu/courses/cmps179/Spring13/02 |
Spring 2013 UC Santa Cruz Santa Cruz, CA |
Co-Instructor (Unofficial) CMPS 245: Computational Models of Discourse and Dialogue This was a seminar course that explored several theoretical and computational models of discourse. Each week students read several papers and presented the reading to the rest of the class. In addition to several homework assignments the students prepared a quarter long final project based on a topic of interest. The course was primarily run by Marilyn Walker, however I helped build the syllabus and lead discussions in class.
http://courses.soe.ucsc.edu/courses/cmps245/Spring13/01/pages/computational-models |
Fall 2009 USC Los Angeles, CA |
Teaching Assistant CSCI402: Operating Systems As a teaching assistant I helped students learn about several core concepts underlying modern operating systems including issues of concurrency, system calls, virtual memory and networking. My primary role was conducted during office hours where students sought help in the topics covered in lecture and problems that arose in their programming assignments. In addition I helped in the administration and grading of the projects and exams.
|
Awards
2010 | Nominated for the ACM Thesis Award |
2005-2009 | USC Viterbi School of Engineering and Computer Science Doctoral Fellowship |
Art Collaborations
2010 | With James Coupe, “Today, Too, I Experienced Something I Hope to Understand in a Few Days” http://jamescoupe.com/?p=778 |
Organized Events
2012 | Organizing chair of 1st Workshop on Human Computation in Digital Entertainment hosted at AIIDE 2012 |
Patents
2011 | Role-play simulation engine (US20130066608 A1) with Asa K. Kalama, Cory J. Rouse and Michael Ilardi |