Difference between revisions of "Tag der Computerlinguistik"

From FachschaftSprachwissenschaft
Jump to: navigation, search
(Text Mining)
Line 70: Line 70:
=== Text Mining ===
=== Text Mining ===
How do search engines work? What's a (linguistic) Corpus?
How do search engines work? What's a (linguistic) Corpus? Ideas:
* Present an annotated corpus with a cool interface (latest SPLICR alpha version maybe)
* Automatics text mining (possible demo application: WERTi)
==== Volunteers ====
==== Volunteers ====
* [[User:Kilian|Kilian]]
* [[User:Kilian|Kilian]]

Revision as of 11:51, 24 April 2008

The Day of Computational Linguistics will be held somewhere during May this year and will serve to attract potential students from nearby high schools and colleges to our course program. After an introduction to computational linguistics in general and to Tübingen's ISCL in particular, attendees will be free to gather information from several different sections, each devoted to one particular facet of CL. The event is currently being organized by the Fachschaft members and if you are willing to join the preparations, you are very welcome to do so.

Date, Place and Time

The Open Door Day might be on 3 May in the SfS.

Well it is probably convenient to think as well of a proper beginning time and duration of the event. So, What about 10 a.m. as a beginning time and 4-5 hours at most as a duration (of course, on demand we can always continue, but it is good to have a plan)?


Here is both a PNG-version of the posters, as well as the original inkscape-made SVG. Please use the SVG if you are going to make any changes to the poster. Please use the PNG if you only want to look at it.

Drawing.png Drawing.svg

Here is the Java source that was used to generate the background "noise" (which was originally taken from gutenberg.org and is based on an excerpt from Romeo and Juliet).



Prof. Hinrichs will give an official introduction to the visitors presenting Tübingen and the course studies in ISCL.

  • Place: Not known yet

Information Bazaar

Everyone can go to visit a total of five different sections, each devoted to a particular topic:

Industry talks

We will invite people from EML as well as IBM and probably other companies (Daimler Chrysler?) to give talks. The absolute maximum on the number of talks is three.

  • Place: Not known yet


No content yet.

  • Place: Not known yet


No content yet.

  • Place: Not known yet


We want to invite our teachers to hold a sort of introductory lecture. Who shall we invite? Ideas so far are:

  • Sam Featherson - Place: Not known yet
  • Frank Richter - Place: Not known yet

Food and Drinks

During the whole program, or at least a large subset of it, food and drinks will be served in the hall. The faculty will pay for this as well.

  • Place: Somewhere in the hall. I think the first floor makes sense.


Each section will give a short intro and is to be manned by two of us. Please volunteer.


Presentation of intriguing examples, most likely from German, since most attendees are going to be German. Ideas include:

  • Collection of marked sentences in Sternefeld 2006 - initiate discussion about their grammaticality
  • Presenting ambiguities in languages
  • Show how different languages can be (there is an excellent example of Chinese weirdness here.

On basis of the examples we can try justify bracketing patterns and tree structures and present that.


  • Anonymous
  • Anonymous


The station to convince the mathematically-minded of our program. At this station, people will be able to play around with

  • Finite State Automata (using the nice graphical tools we have)
  • other graph structures (trees etc.)
  • some easy proofs in set theory (not formal, all based on common sense)

And I also intend to offer some info on

  • complexity classes (hard versus easy problems, feasibility of computation etc.)
  • How To Encode Infinity (cyclic structures, non-termination etc.)


Text Mining

How do search engines work? What's a (linguistic) Corpus? Ideas:

  • Present an annotated corpus with a cool interface (latest SPLICR alpha version maybe)
  • Automatics text mining (possible demo application: WERTi)



This section will present a short introduction to Computer Science as practised in CL to the visitors. It will contain an introduction to problem solving using systematic methods (probably Algorithms, though people have voted to put that into the mathematics/logics section) including (but not limited to)

  • Object Oriented programming
  • Presentation of typical homeworks or projects (passivator)



This was an idea given by Anas so that there is a possibility algorithms to be explained without really showing and using any "scary" code for the purpose.


  • Anas
  • Anonymous

Ideas / Sources

It seems that there is a very nice introduction to CL on the pages of CL in Stuttgart. Anyone willing to share a link? Also, Hubert Truckenbrodt's scripts for introduction to Phonology and Ede Zimmerman's scripts for introduction to Semantics are very easy to understand and contain a lot of good examples.


This is the section for all small things that we can or have to do.

  • Guest Book
  • Flyer and Info materials for take away
  • Some "Werbegeschenke" will be as well quite nice to have
  • Orientation sheets (maps and posters showing the way to the different rooms)