Tag der Computerlinguistik
This is a page for keeping track of the organization of this event. For the public page go to Tag der Offenen Tür.
The Day of Computational Linguistics will be held on Saturday, June 21, 2008 and will inform students from nearby high schools and universities about our course program. After an introduction to computational linguistics in general and to Tübingen's ISCL in particular, attendees will be free to gather information from several different info sections, each devoted to one particular facet of CL. The event is currently being organized by the Fachschaft members and if you are willing to join the preparations, you are very welcome to do so.
- 1 Date and Place
- 2 Schedule
- 3 Poster
- 4 Program
- 5 Info Sections
- 6 Food and Drinks
- 7 Fachschaft Workspace
Date and Place
The Open Door Day will take place on Saturday, June 21, 2008 in the Seminar für Sprachwissenschaft (SfS), Wilhelmstraße 19, Tübingen.
- Rooms: we have the computer pool, room 1.13 and 1.01. (The lecture halls are already booked by others.)
(as of June 19th)
- 10:00 Visitors arrive, gathering and talking
(The rest of this is slowly being turned into German, except for the talks that will be in English, because the official schedule on the official page and the info sheets should be in German, right?)
- 10:20 - 11:00 Prof. Dr. Erhard Hinrichs Willkommensrede / Einführung in das ISCL-Programm
- 11:00 - 11:30 1.13: Niels Ott: Wie kommen die Wörter ins Wörterbuch? Ein kurzer Einblick in computerlinguistische Werkzeuge, die für das Aufspüren von Wortbedeutungen in der Lexikographie verwendet werden.
- 11:30 - 12:00 1.13: Caroline Arnold, Johannes Dellert Keine Angst vor Mathe
- 12:00 - 13:00 Computerpool: Marie Hinrichs Programmierexperimente
- 12:00 - 14:00 1.10: Software-Bazar
- 13:00 - 14:00 Mittagessen / Campusbesichtigung / Software-Bazar
- 14:00 - 15:00 1.13: Simone Paolo Ponzetto, Katja Filippova (EML-Research): The Natural Language Processing Group at EML-Research
- 15:00 - 15:30 1.13: Prof. Dr. Erhard Hinrichs: Das ISCL-Programm (2. Inforede)
- 15:00 - 16:00 1.10: Software-Bazar
- 15:30 - 16:00 1.13: Magdalena Leshtanska NLP-aided Sentiment Detection
- 16:00 - 16:30 1.13: Anas Elghafari How come that stupid computers can do smart things? An Introduction to Algorithms.
- 16:30 - 17:00 1.13: Anne Brock Vom Bild zum Buchstaben. Optical Character Recognition.
- 17:00 Visitors leave
Here is both a PNG-version of the posters, as well as the original inkscape-made SVG. Please use the SVG if you are going to make any changes to the poster. Please use the PNG if you only want to look at it.
Talks by the Faculty
- Prof. Dr. Erhard Hinrichs custom essay
- Marie Hinrichs - essay writers
- Sam Featherson - no confirmation yet
- Kilian: LaCrIMoSA
- One of Nomi, Tanya, Plamena, and Anas: Passivator (laptop needed)
Talks by Students (?)
- Magdalena about NLP-aided sentiment detection
- Niels about corpora and lexicography
- Caroline about introductory maths
- Nomi? Anas said that you said that you might do something - what and how much?
Don't have time, but would talk if nobody else is found:
- Aleks about his internship (he promised something "tactile" - what exactly is the topic, again?) -> maybe better make this a software showpiece
- Laura about her internship (taxonomy from Wikipedia, at EML - maybe not a good idea if the EML people tell the same stuff)
- Anne: about her internship (stemming and OCR)
All the people listed under the various info point sections might be requested to change their contributions to short talks, too.
Talk by EML
- Place: Not known yet
Each section will give a short intro and is to be manned by two of us. Please volunteer.
Volunteers: Anonymous, Anonymous
Presentation of intriguing examples, most likely from German, since most attendees are going to be German. Ideas include:
- Collection of marked sentences in Sternefeld 2006 - initiate discussion about their grammaticality (e.g. "weil es wird aufhören können zu regnen" vs. "weil es hätte aufhören müssen zu regnen", "den Kuchen bäckt die Mutter und isst der Franz" vs. "den Kuchen bäckt die Mutter und isst der Franz Kaugummi")
- Presenting ambiguities in languages
- Show how different languages can be (there is an excellent example of Chinese weirdness here). We should only mention languages that people can actually learn in Tübingen. Good candidates for weirdness are certainly Old Irish and Nahuatl. We could actually present one language from every major typological category (e.g. Turkish for agglutinative, Icelandic or Old Irish for inflectional, Chinese for isolating and Nahuatl for (moderately) polysynthetic.
On basis of the examples we can try justify bracketing patterns and tree structures and present that.
Volunteers: Johannes, Caroline?
The station to convince the mathematically-minded of our program.
At this station, people will be able to play around with a few mathematical concepts and tools that we use every day. It is somewhat hard to assess how much mathematical background people will have, so we should be prepared to explain everything from scratch. Offering a broad overview rather than a few little gems might help to avoid problems if some parts are less understandable than expected, and the risk of boredom with the audience is also minimized.
I know that I am probably proposing way too much here. Please tell me which of these numerous ideas you consider adequate, or provide me with some additional ideas.
On the whole, I suggest concentrating on three major topics:
1. Theoretical Computer Science
- demonstrate finite-state technology by means of a transducer that encodes some fancy morphological rules, preferably something German such as subjunctive inflection or plural forms for certain noun classes; perhaps use some graphical tool to project the FST onto a wall and let it process random strings ?
- explain the canonical "S --> VP NP" style toy CFG and discuss how this describes a language (introduce notions such as syntactic structure, derivation, ambiguity etc.)
- take this toy CFG to introduce CYK parsing and let people fool around a bit with it
- explain why it is not wise to simply try out all alternatives until the solution is found, this could be a good way of introducing complexity classes
- mention some undecidable problems and point out intuitively why they are undecidable
- create some confusion and mystery about NP-completeness and the P=NP problem
- introduce the basic set-theoretic notions and state some common sense theorems
- informally introduce basic predicate logic (boolean connectives, quantifiers etc.)
- demonstrate how useful FOL is for expressing facts about objects and their relations ("model theory")
- introduce the canonical scope ambiguity example (ExAy vs AxEy) to motivate its use in formal semantics
- maybe show the Peano axiomatization for natural numbers (not really CL-related, but nice to discuss notions like axioms, models etc.)
3. Discrete Mathematics
- introduce graphs and especially trees, explaining how to formalize them
- introduce the concepts of recursion and induction by proving some trivial property of trees
- combinatorics, e.g. "How many ways are there to bracket an expression?"
- some illustrative example for combinatorical explosion, perhaps some hints on how to avoid that
How do search engines work? What's a (linguistic) Corpus? Ideas:
- Present an annotated corpus with a cool interface (latest SPLICR alpha version maybe)
- Automatics text mining (possible demo application: WERTi)
Volunteers: Aleks, Anonymous
This section will present a short introduction to Computer Science as practised in CL to the visitors. It will contain an introduction to problem solving using systematic methods (probably Algorithms, though people have voted to put that into the mathematics/logics section) including (but not limited to)
- Object Oriented programming
- Presentation of typical homeworks or projects (passivator)
Volunteers: Anas, Anonymous
This was an idea given by Anas so that there is a possibility algorithms to be explained without really showing and using any "scary" code for the purpose.
- Sorting and search algorithms could actually be used for an activity game. Let two teams of people try to sort a chaotic array of objects with as few steps as possible. People can choose to adhere to one of the standard algorithms or to use human intuition. Starting from the results, one could then introduce notions such as amortized analysis, divide-and-conquer, worst-case behaviour and average-case behaviour.
Food and Drinks
During the whole program, or at least a large subset of it, food and drinks will be served in the hall. The faculty will pay for this as well.
- Place: Somewhere in the hall. I think the first floor makes sense. Edit from Laura: I was thinking of having the welcome desk in the hall on the first floor and the food and drink stuff in room 1.10, where we will also put all the comfy seats and some tables to create a lounge-y feeling so that people will come there and actually have a look at the software projects.
- Suggested food and drinks:
- Brezel - 60
- Brötchen (Vollkorn) - 10 20!
- Apples - 20
- Bananas - 20
- Apple juice
- something fanta- or cola-ish
- tea (black, green, herbal, fruit)
- coffee if we can get it
Ideas / Sources
It seems that there is a very nice introduction to CL on the pages of CL in Stuttgart. Anyone willing to share a link? Also, Hubert Truckenbrodt's scripts for introduction to Phonology and Ede Zimmerman's scripts for introduction to Semantics are very easy to understand and contain a lot of good examples.
Open tasks or TODO
This is the section for all small things that we can or have to do. Volunteers for those tasks should as soon as possible contact Desi for more information.
- Print out the Program of the event together with helpful infos - Desi
- Make labels with the names of all of us (SFS): - Desi
- Laura Kassner
- Niels Ott
- Aleksandar Dimitrov
- Aleksandar Savkov
- Anas Elghafari
- Nomi Meixner
- Dominikus Wetzel
- Anne Brock
- Kilian Evang
- Johannes Dellert
- Maria Tchalakova
- Katya Volkova
- Tatiana Vodolazova
- Magdalena Leshtanska
- Emma Li
- Evgenia Ivanova
- Iliana Simova
- Maria Schmidt
- Ramon Ziai
- Desislava Zhekova
- Organize food and drinks - Desi
- Flyer and Info materials for take away - should contain information like: - Kilian
- Application deadline for the ISCL program (July 15)
- Necessary documents for the application
- Some FAQs from the SfS webpage
- Information about CL in general (should be more than on the poster)
- Contact data (email, webpage, ...)
- Some "Werbegeschenke" will be as well quite nice to have
- Orientation sheets (maps and posters showing the way to the different rooms)
- Send around posters per mail - done by Maria (2nd semester)
- Stick posters around in Tübingen - done by Iliana
- Talk to the Tübingen press (Television, papers, radios...find a contact and talk to them if they could include us as news) - done by Laura
- Guest Book - dropped
Please put your name (or if you prefer anonymous votes just a pipe (|)) for the badge layout you like most.
1. UNI LOGO:
2. ISCL UNI:
3. UNI ISCL: Laura, Johannes, Anas, Maria
4. ISCL: Desi, Kilian
See also: Custom Essay Website.