Recent Progress in Data Integration
Daniela Florescu (INRIA Rocquencourt)
Alon Levy (University of Washington)

Managing Multimedia Information in a Database Environment
William I. Grosky (Wayne State University)

Invited Speakers:

Rules...and what's next?
Towards Second Generation Data Mining Systems
Tomasz Imieli¸ski (Rutgers University)

Workflow Management in the Internet Age
C. Mohan (IBM Almaden Research Center, INRIA Rocquencourt)
Monday, September 7
08:30 - 10:00 Tutorial 1
Recent Progress in Data Integration
Daniela Florescu (INRIA Rocquencourt),
Alon Levy (University of Washington)
10:00 - 10:30 Coffee Break
10:30 - 12:00 Continuation of Tutorial 1
12:00 - 12:30 Coffee Break
12:30 - 14:00 Continuation of Tutorial 1
14:00-15:30 Lunch Break
15:30-17:00 Tutorial 2
Managing Multimedia Information in a Database Environment
William I. Grosky (Wayne State University)
17:00-17:30 Coffee Break
17:30-19:00 Continuation of Tutorial 2

Tuesday, September 8
09:00 - 09:30 Opening Session
09:30 - 10:30 Keynote Talk
Rules...and what's next?
Towards Second Generation Data Mining Systems
Tomasz Imielinski (Rutgers University)
10:30 - 11:00 Coffee Break
11:00 - 12:30 Session 1: Query Languages
Untyped Queries, Untyped Reflective Machines and Conditional Quantifiers
Jose Maria Turull Torres
Containment of Conjunctive Queries with Built-in Predicates with Variables and Constants over any Ordered Domain
Nieves R. Brisaboa, Hector J. Hernandez, Jose R. Parama, Miguel R. Penabad
Using Queries with Multi-Directional Functions for Numerical Database Applications
Staffan Flodin, Kjell Orsborn, Tore Risch
12:30-14:30 Lunch Break
14:30-16:00 Session 2: Optimization
Multiple Range Query Optimization in Spatial Databases
Apostolos N. Papadopoulos, Yannis Manolopoulos
Optimizing Command Logs by Exploiting Semantic Knowledge
Roland Baumann
A Distributed Algorithm for Global Query Optimization in Multidatabase Systems
Silvio Salza, Giovanni Barone, Tadeusz Morzy
16:00-16:30 Coffee Break
16:30-18:00 Session 3: Collaborative Systems
Transaction Management in Databases Supporting Collaborative Applications
Waldemar Wieczerzycki
Object-Oriented Design of a Flexible Workflow Management System
Mathias Weske
Extending Transaction Closures by N-ary Termination Dependencies
Kerstin Schwarz, Can Tuerker, Gunter Saake
19:30 Reception
Wednesday, September 9
09:00-10:00 Invited Talk
Workflow Management in the Internet Age
C. Mohan (IBM Almaden Research Center)
10:00-10:30 Coffee Break
10:30-12:30 Session 4: East Meets West
Distributed Information Systems
Ralf Kramer, Peter C. Lockemann
Integrated Web - Database Applications for Electronic Business
Wojciech Cellary
Term Weighting in Query-Based Document Clustering
Kai Korpimies, Esko Ukkonen
Discovery of Object-Oriented Schema and Schema Conflicts
Hele-Mai Haav, Mihhail Matskin
On the Ordering of Rewrite Rules
Joachim Kroeger, Stefan Paul, Andreas Heuer
Towards Data and Object Modelling
Jaroslav Pokorny
12:30-14:30 Lunch Break
14:30-16:00 Session 5: Schema Integration
Propagation of Structural Modifications to an Integrated Schema
Regina Motz
Integration of Schemas Containing Data Versions and Time Components
Bogdan D. Czejdo, Maciej Matysiak, Tadeusz Morzy
Deriving Relationships between Integrity Constraints for Schema Comparison
Can Tuerker, Gunter Saake
16:00-16:30 Coffee Break
16:30-18:00 Session 6: Storage and Version Management
A Database Interface Integrating a Querying Language for Versions
Eric Andonoff, Gilles Hubert, Annig Le Parc
The nP-Tree: Region Partitioning and Indexing for Efficient Path Planning
Lusiana Nawawi, Janusz R. Getta, Phillip J. McKerrow
Replication in Mirrored Disk Systems
Athena Vakali, Yannis Manolopoulos
19:30 Choir Concert
Thursday, September 10
09:00-10:30 Session 7: Object Systems
Clustering Techniques for Minimizing Object Access Time
Vlad S. Wietrzyk, Mehmet A. Orgun
Designing Persistence for Real-Time Distributed Object Systems
Igor Nekrestyanov, Boris Novikov, Ekaterina Pavlova
Partial Replication of Object-Oriented Databases
Michael Dobrovnik, Johann Eder
10:30-11:00 Coffee Break
11:00-13:00 Session 8: Knowledge Discovery and the Web
Optimizing Knowledge Discovery over the WWW
Matthew Montebello
Data Mining Query Language for Object-0riented Database
Vladimir Novacek
Itemset Materializing for Fast Mining of Association Rules
Marek Wojciechowski, Maciej Zakrzewicz
Schema Derivation for WWW Information Sources and their Integration with Databases in Bioinformatics
Michael Hoeding, Ralf Hofestaedt, Gunter Saake, Uwe Scholz
13:00-15:00 Lunch Break
15:00-16:30 Session 9: System Design
Component-based Information Systems Development Tool Supporting the SYNTHESIS Design Method
Dmitry O. Briukhov, Leonid A. Kalinichenko
Translating Relational Queries to Object-Oriented Queries According to ODMG-93
Ahmed Mostefaoui, Jacques Kouloumdjian
A Flexible Framework for a Correct Database Design
Donatella Castelli, Serena Pisani
16:30-17:00 Coffee Break
17:00-18:30 Session 9: Industrial Track
Human Resources Information Systems Improvement: Involving Financial Systems and Other Sources Data
Sergey Zykov
Physical Structures Design For Relational Databases
Janusz Charczuk
Modeling of Census Data in a Multidimensional Environment
Holger Guenzel, Wolfgang Lehner, Stein Eriksen, Jon Folkedal


Daniela Florescu is a researcher in the Rodin group in INRIA Rocquencourt. She received her B.Sc. in Computer Science and Mathematics from the University of Bucharest in 1991, Masters of Computer Science from University of Paris VI in 1992, and a Ph.D. in Computer Science from INRIA and the University of Paris VI in 1996. From November 1996 till December 1997 she was a senior member of the technical staff at AT&T Research Laboratories. Her current research interests are information integration, query optimization in object­oriented database systems, query execution models for parallel databases, query reformulation in multidatabase systems, semistructured data and web­site management systems.

Alon Levy is an assistant professor in the University of Washington in Seattle. He received his B.Sc. in Computer Science and Mathematics from the Hebrew University in Jerusalem in 1988, and his Ph.D. in Computer Science from Stanford University in 1993. From 1993 to 1997 he was a principal member of the technical staff at AT&T Research Laboratories. His current research interests are information integration, semistructured data, materialized views, web­site management systems, knowledge representation and connections between database systems and Artificial Intelligence.

William I. Grosky is currently professor and chair of the Computer Science Department at Wayne State University in Detroit, Michigan. His current research interests are in multimedia information systems, hypermedia, image databases, and web technology. Dr. Grosky received in B.S. in mathematics from MIT in 1965, his M.S. in Applied Mathematics from Brown University in 1968, and his Ph.D. in Engineering and Applied Sciences from Yale University in 1971. Serving on many database and multimedia conference program committees, he is currently the Editor-in-Chief of IEEE Multimedia magazine and on the editorial boards of the Journal of Database Management and Pattern Recognition.

Tomasz Imieli¸ski is currently Professor and Chairman of the Department of Computer Science, Rutgers University in New Brunswick NJ, USA. He has received his Ph.D. from Polish Academy of Science (Warsaw) in 1982. His initial work dealt with the issues of representation and querying of databases with incomplete information. His current interests include database mining and mobile wireless computing. Dr. Imielinski is currently Director of Mobile Computing Laboratory (DataMan) at Rutgers University which is sponsored by DARPA, NSF and a number of companies. Since coauthoring the original paper introducing association rules Dr. Imielinski has been leading the development of "Discovery Board" the experimental data mining system developed at Rutgers University. He has been active in numerous program committees of conferences such as SIGMOD, VLDB, KDD and Mobicomm and is currently an Associate Editor of ACM/Baltzer Nomad - journal of Wireless and Mobile Communications and Computing, and Knowledge Discovery Journal (Kluwer). He has edited three books, including "Mobile Computing", (T. Imielinski, H.Korth, Kluwer 1996). Dr. Imielinski is a cofounder and chief technology officer of Hevelius Software - a company which develops data mining applications.

Dr. C. Mohan, after graduating from the Indian Institute of Technology at Madras in 1977 and the University of Texas at Austin in 1981, joined the IBM Almaden Research Center. In June 1997, he was named an IBM Fellow for being recognized worldwide as a leading innovator in database transaction management. He received the 1996 ACM SIGMOD Innovations Award in recognition of his innovative contributions to the development and use of database systems. Since late 1996, he has been leading the Dominotes project whose goal is to enhance Lotus Domino/Notes by introducing transactional recovery. Earlier, he led the Exotica workflow management project which was focused on IBM's FlowMark, MQSeries and Lotus Notes. During 6/98-6/99, he is on a sabbatical at INRIA, Rocquencourt (France). Mohan has received numerous IBM awards: 1 Corporate Award, 7 Outstanding Innovation Awards, 2 Research Division Awards and the 9th Plateau Invention Achievement Award for patent activities (28 issued, 4 pending). He was a designer and an implementor of the R* distributed DBMS, the Starburst extensible DBMS and DB2. His algorithms have been implemented in several IBM and non-IBM products, and university prototypes. He is the primary inventor of the ARIES family of recovery and locking methods, and the industry-standard Presumed Abort commit protocol. His research interests include concurrency control, recovery, commit protocols, index management, semi-structured data management, query optimization, active databases, OODBMSs, workflow, and distributed systems. He was the Americas Program Chair for the 1996 International Conference on Very Large Data Bases, the Program Chair of the 1987 International Workshop on High Performance Transaction Systems, and a Program Vice-Chair of the 1994 International Conference on Data Engineering. He is an editor of the VLDB Journal, and Distributed and Parallel Databases - An International Journal.

Recent progress in data integration

In the last few years there has been considerable interest in the problem of providing access to large collections of distributed
heterogeneous information sources (e.g., sources on the World-Wide Web, company-wide databases). This interest has spawned a significant amount of research in Database Systems and related fields (e.g., Artificial Intelligence, Operating Systems, Human Computer Interaction). This has led to the development of several research prototypes for information integration and recently, we are seeing the beginnings of an industry addressing this problem.

The goal of this tutorial is to survey the work on information integration, to illustrate the common principles underlying this body
of work, to assess the state of the art, and identify the open research problems in this area. The tutorial will illustrate the issues
involved in information integration through several implemented systems.

Managing Multimedia Information in a Database Environment

We discuss multimedia information management from the point of view of database systems: how the various aspects of database design and the modules of a database system have evolved over the years to better manage multimedia information. We start by examining the nature of multimedia data and the area of multimedia data modeling. We then discuss how multimedia data has influenced the evolution of the various modules of a standard database system; specifically, those having to do with query processing, choice of access methods, query optimization, transaction management, buffer management, storage management, recovery, and security. Finally, we consider the various commercial systems that have recently appeared which manage multimedia information.

Supported by Synthesis Group