Navigation

Open Source Projects

Durm German Lemmatizer
IntelliGenWiki
Javadoc NLP Corpus Generator
LODeXporter
LODtagger
Predicate-Argument Extractor
Multi-lingual NP Chunker
Open Mutation Miner
OpenTrace
OrganismTagger
OwlExporter
Reported Speech Tagger
ReqWiki
Rhetector

The Fuzzy Book

You can read my book Architektur von Fuzzy-Informationssystemen (Architecture of fuzzy information systems) in both a dead-tree and a (free!) online version.

Take a quote

Insanity: doing the same thing over and over again and expecting different results.

— Albert Einstein

Search

Results 81 - 90 of 91

Results

Fuzzy Clustering for Topic Analysis and Summarization of Document Collections

Montreal 2007

Abstract

Large document collections, such as those delivered by Internet search engines, are difficult and time-consuming for users to read and analyse. The detection of common and distinctive topics within a document set, together with the generation of multi-document summaries, can greatly ease the burden of information management. We show how this can be achieved with a clustering algorithm based on fuzzy set theory, which (i) is easy to implement and integrate into a personal information system, (ii) generates a highly flexible data structure for topic analysis and summarization, and (iii) also delivers excellent performance.

Ontological Text Mining of Software Documents

Paris, France

Abstract

Documents written in natural languages constitute a major part of the software engineering lifecycle artifacts. Especially during software maintenance or reverse engineering, semantic information conveyed in these documents can provide important knowledge for the software engineer. In this paper, we present a text mining system capable of populating a software ontology with information detected in documents.

Task-Dependent Visualization of Coreference Resolution Results

A single coreference chains visualized as a Topic Map

Abstract

Graphical visualizations of coreference chains support a system developer in analyzing the behavior of a resolution algorithm. In this paper, we state explicit use cases for coreference chain visualizations and show how they can be resolved by transforming chains into other, standardized data formats, namely Topic Maps and Ontologies.

Next-Generation Summarization: Contrastive, Focused, and Update Summaries

Conference Hotel, Borovets, Bulgaria

Abstract

Classical multi-document summaries focus on the common topics of a document set and omit distinctive themes particular to a single document—thereby often suppressing precisely that kind of information a user might need for a specific task. This can be avoided through advanced multi-document summaries that take a user's context and history into account, by delivering focused, contrastive, or update summaries. To facilitate the generation of these different summaries, we propose to generate all types from a single data structure, topic clusters, which provide for an abstract representation of a set of documents. Evaluations carried out on five years' worth of data from the DUC summarization competition prove the feasibility of this approach.

Durm German Lemmatizer v1.0 Released

Submitted by rene on Thu, 2007-05-31 08:59.

I'm happy to announce the first public release of our free/open source Durm Lemmatization System for the German language.

The release comes with source code, binaries, documentation, resources (German lexicon, Case Tagger probabilities), and manually annotated texts from the German Wikipedia for evaluation.

New Job, New Website

Submitted by rene on Sat, 2008-05-31 19:00.

Personal

As of June 1st, 2008, I'm now working as an assistant professor in the Department of Computer Science and Software Engineering at Concordia University in Montréal, Canada. Coinciding with the new position, I'm also building a new website, www.semanticsoftware.info. There are two main ideas behind this website: First, to inform about the research and teaching activities of my Semantic Software Lab, which I'm establishing at Concordia; and second, to establish a community portal for selected topics in the area of semantic systems — for example, for people interested in the applications of NLP in software engineering.

Deadline extended for STSM

Submitted by rene on Wed, 2008-04-16 22:36.

We extended the paper submission deadline for our workshop on Semantic Technologies in System Maintenance (STSM) to April 25th.

Call for Papers: International Workshop on Semantic Technologies in System Maintenance (STSM 2008)

Submitted by rene on Tue, 2008-02-12 21:19.

Together with Jürgen Rilling, Dragan Gaševi?, and Jeff Z. Pan I'm organizing the first International Workshop on Semantic Technologies in System Maintenance (STSM 2008), which will be co-located with the 16th IEEE International Conference on Program Comprehension (ICPC 2008) in Amsterdam, The Netherlands.

Detailed information on the workshop, submission guidelines, and other news are now available from the workshop's webpage.

Workshop on Semantic Technologies in System Maintenance at ICPC 2008

Submitted by rene on Wed, 2008-01-23 21:33.

It's official: I'm co-organizing the (first) International Workshop on Semantic Technologies in System Maintenance (STSM) at the next IEEE International Conference on Program Comprehension (ICPC 2008) in Amsterdam, The Netherlands. Some preliminary information are available on the ICPC website. A call for papers and more details are coming soon!

Workshop on Traceability at CASCON 2007

Submitted by rene on Tue, 2007-10-02 09:06.

Traceability

Together with Juergen Rilling from Concordia University and Philippe Charland from the DRDC Canada I'm organizing a workshop at CASCON 2007: Traceability in Software Engineering—Past, Present and Future. It's on October 25 at the Sheraton Parkway Toronto North Hotel and Convention Centre, Ontario, Canada.

rene-witte.net

Navigation

Open Source Projects

The Fuzzy Book

Take a quote

Search

Results

Fuzzy Clustering for Topic Analysis and Summarization of Document Collections

Abstract

Ontological Text Mining of Software Documents

Abstract

Task-Dependent Visualization of Coreference Resolution Results

Abstract

Next-Generation Summarization: Contrastive, Focused, and Update Summaries

Abstract

Durm German Lemmatizer v1.0 Released

New Job, New Website

Deadline extended for STSM

Call for Papers: International Workshop on Semantic Technologies in System Maintenance (STSM 2008)

Workshop on Semantic Technologies in System Maintenance at ICPC 2008

Workshop on Traceability at CASCON 2007

See also

Poll

Popular content

Today's:

Last viewed: