Curriculum Vitae

Statement


I have a strong multidisciplinary background and an interest for exploring ideas that crosses traditional academic boundaries. My primary research is in the application of natural language processing for computationally modeling and generating narratives. Although centered on core areas of Computer Science, my research and experience touches upon the arts, humanities and social sciences. Bringing these disciplines together is necessary to develop computational systems and agents that have a deep understanding of human activities, social behavior and commonsense beliefs. This knowledge is critical for the future of successful human computer interaction.


Education

2005-2010
Los Angeles, CA
Ph.D., Computer Science
Advisor: Andrew S. Gordon
Viterbi School of Engineering
University of Southern California
2003-2005
Los Angeles, CA
M.A., Computational Linguistics
College of Letters, Arts and Sciences
University of Southern California
1994-1999
Los Angeles
B.A., Fine Art
School of the Arts and Architecture
University of California Los Angeles
1994-1999
Los Angeles
B.A. Philosophy
College of Letters, Arts and Sciences
University of California Los Angeles

Research Experience


2017-Present
Independent
Independent Research
Various
I have been working on a number of research and development projects. I am investigating semi-supervised techniques for word sense disambiguation. Using discourse analysis to analyze threaded discussions to highlight more informative posts, provide summaries of large discussions, and develop reputations systems that reward a diversity of thoughts rather than groupthink. Finally, I am also investigating techniques in sports analytics with an emphasis on Fantasy team management and applications.
Aug 2015-2017
USC/ICT
Los Angeles, CA
Research Scientist
Interpretation of Personal Narratives
My research at the Institute for Creative Technologies focused on using natural language processing to analyze narrative discourse. One aspect of this research is to develop techniques for rapidly authoring interactive stores and driving these simulations based on models learned from real-world narratives. We also investigated how narrative discourse structure of personal stories affects attitude change in individuals regardless of the content of the message.
Sep 2013-May 2015
UC Santa Cruz
Santa Cruz, CA
Postdoctoral Scholar
Opinion Sharing Dialog in Social Media
(PI: Marilyn Walker)
Most automated analysis of discourse related to persuasion and argumentation focuses on the propositional content of the discussion. In contrast to this work, we are examining the role subjective language plays in they way discussion evolve in online dialogs. We are exploring several types of subjective language such as sarcasm, stance and opinion versus fact. We are investigating the nature of how people perceive this language and the reliability of judgments on these kinds of tasks. Guided by our users studies we are using machine learning to automatically identify this language in posts for further large-scale analyses.
Apr 2012-Jun 2013
UC Santa Cruz
Santa Cruz, CA
Postdoctoral Scholar
IMMERSE
(PI: Michael Mateas)
Most automated analysis of discourse related to persuasion and argumentation focuses on the propositional content of the discussion. In contrast to this work, we are examining the role subjective language plays in they way discussion evolve in online dialogs. We are exploring several types of subjective language such as sarcasm, stance and opinion versus fact. We are investigating the nature of how people perceive this language and the reliability of judgments on these kinds of tasks. Guided by our users studies we are using machine learning to automatically identify this language in posts for further large-scale analyses.
Jun 2011-Apr 2012
Jun 2013-Sep 2013
UC Santa Cruz
Santa Cruz, CA
Postdoctoral Scholar
SIREN
(PI: Arnav Jhala)
The SIREN project is a large European Union grant with partners in over 5 countries that are developing educational games to help children improve the way they handle conflicts. My primary focus on this project is to leverage current psychological theories of interpersonal conflicts to create natural and dynamic character models by blending large-scale data driven approaches with hand authored methods.
Apr 2010-Apr 2011
WDI R&D
Glendale, CA
Research Lab Associate
Interactive Storytelling
At Disney I developed technologies for supporting several types of computer aided interactive storytelling applications. In particular, researching the latest classical, probabilistic and case-based planning techniques to be used for automated show control, visualization strategies for improving authoring efficacy and efficiency for interactive stories, and large scale ontology and semantic relationship extraction from the web to improve the scope of knowledge our system could reason about.
2004-2009
USC/ICT
Los Angeles, CA
Research Assistant
Thesis Topic: Say Anything
I developed an open-domain, interactive narrative generation system, where a person and computer take turns writing sentences of an emerging fictional narrative. A combination of information retrieval, machine learning and natural language processing enable the system to return sentences that produce stories nearly indistinguishable from weblog stories written entirely by humans. Although primarily conceived in the entertainment domain, it also has several other promising applications, such as a tool for children’s education, an automated improvisational exercise, a writer’s aid or general inference mechanism.
Masters Project
I developed a specialized search engine for retrieving first person narrative stories. As part of the project a story classification program was developed to identify narrative sentences using a Support Vector Machine classifier and a Gaussian filter. A custom retrieval engine was used to find relevant stories from a large collection gathered using Google Blog Search and the story classifier. To find relevant stories the retrieval engine used a vector space model with lexical, part-of-speech and semantic role label features extracted from a custom SRL system.
Natural Language Processing
In addition to the natural language processing that is implicitly necessary to support my thesis project I have also explicitly explored several core NLP topics. These include techniques for domain adaptation in semantic role labeling, applying unsupervised linguistic parsing techniques to other structured data such as music and I have also explored syntactic language models for automatic speech recognition.
Common Sense Reasoning
As part-of a research project in our lab to create several resources around a large- scale ontology of common-sense psychology, I helped organize data collection sessions for gathering natural language expressions capturing concepts in this ontology. The data was then used to author finite state transducers capable of recognizing and tagging these concepts in English text. In addition to collecting natural language expressions for these theories I also tried to validate the theory of Memory, one of Gordon & Hobbs core common-sense theories. The validation was performed by proving several common-sense strategies for Memory storage and retrieval using an automatic theorem solver and a first-order representation of the theory and strategies.
Summer 2006
IBM Watson
Hawthorn, NY
IBM Watson Research Center Internship
Semantic Network Visualization
During this internship I designed a mixed reality visualization system for aiding in the development of real-world large-scale semantic networks used in IBM’s Semantic Engine. The visualization used clustering techniques to isolate semantically similar sub-groups of the network. Each sub-group was arranged independently using a standard 2D graph layout algorithm. These 2D clusters were then mapped onto the 3D surface of a globe using an API for Google Earth. The inherent shape of the earth naturally emphasized relevant information by occluding less similar parts of the network beyond the horizon and the resulting interface made inspecting and traversing graphs of thousands of nodes and links easier through clear visual feedback. The interface could be traversed normally using the standard Google Earth client, but also through the manipulation of a physical globe in the real world, using RFID tags to track the position of the globe.

Teaching Experience

Winter 2015
UC Santa Cruz
Santa Cruz, CA
Lecturer
CMPS 245: Computational Models of Discourse and Dialogue
This is a graduate course discussing current and foundational work in special topics related to computational models of discourse and dialogue. This quarter focused on several deep semantic contrasted with statistical representations of narrative discourse.
https://courses.soe.ucsc.edu/courses/cmps179/Spring13/02
Spring 2014
UC Santa Cruz
Santa Cruz, CA
Co-Instructor (Unofficial)
CMPS 143: Introduction to Natural Language Processing
This course provides an introduction to the theory and practice of natural language processing. The course was primarily run by Marilyn Walker, however I helped build the syllabus, design the assignments and ran about half of the lectures.
http://courses.soe.ucsc.edu/courses/cmps143/Spring14/01/pages/nlpsyllabus
Fall 2013
UC Santa Cruz
Santa Cruz, CA
Lecturer
CMPS 012B: Introduction to Data Structures
A basic introduction to basic data structures and algorithms. Teaches fundamental data structures and algorithms in C and Java. Topics include big O notation, linked lists, stacks, queues, binary search trees, hash tables and sorting. Course material largely provided by previous instructor.
Spring 2013
UC Santa Cruz
Santa Cruz, CA
Lecturer
CMPS 179: UX Techniques for Designing 3D, Animation, and Interaction for the Web
This course explored techniques for interactive data visualization using new vector and 3D graphics capabilities of HTML 5. The course emphasized both technical and creative aspects of interactive visualization through the development of several projects using real world data acquired from eBay’s public API.
https://courses.soe.ucsc.edu/courses/cmps179/Spring13/02
Spring 2013
UC Santa Cruz
Santa Cruz, CA
Co-Instructor (Unofficial)
CMPS 245: Computational Models of Discourse and Dialogue
This was a seminar course that explored several theoretical and computational models of discourse. Each week students read several papers and presented the reading to the rest of the class. In addition to several homework assignments the students prepared a quarter long final project based on a topic of interest. The course was primarily run by Marilyn Walker, however I helped build the syllabus and lead discussions in class.
http://courses.soe.ucsc.edu/courses/cmps245/Spring13/01/pages/computational-models
Fall 2009
USC
Los Angeles, CA
Teaching Assistant
CSCI402: Operating Systems
As a teaching assistant I helped students learn about several core concepts underlying modern operating systems including issues of concurrency, system calls, virtual memory and networking. My primary role was conducted during office hours where students sought help in the topics covered in lecture and problems that arose in their programming assignments. In addition I helped in the administration and grading of the projects and exams.

Awards

2010 Nominated for the ACM Thesis Award
2005-2009 USC Viterbi School of Engineering and Computer Science Doctoral Fellowship

Art Collaborations

2010 With James Coupe, “Today, Too, I Experienced Something I Hope to Understand in a Few Days”
http://jamescoupe.com/?p=778

Organized Events

2012 Organizing chair of 1st Workshop on Human Computation in Digital Entertainment hosted at AIIDE 2012

Patents

2011 Role-play simulation engine (US20130066608 A1) with Asa K. Kalama, Cory J. Rouse and Michael Ilardi