Parallel Data Mining Workshop 2011

Held in conjunction with 2011 SIAM International Conference on Data Mining (SDM 2011).

Hilton Phoenix East/Mesa, Mesa, Arizona, USA
April 30, 2011

Hilton Phoenix East/Mesa, Mesa, Arizona, USA

April 30, 2011

Program

13:30-14:30 Invited talk by Pr. Gagan Agrawal
14:30-15:00 Aerial Root Classifers for Predicting Missing Values in Data Stream Decision Tree Classifcation, Yang Hang, Simon Fong, Wei Chen
15:00-15:30 Coffee break
15:30-16:00 Monitoring Multiple Streams with Dynamic Time Warping using Graphic Processors, Jason Chang, Mi-Yen Yeh
16:00-16:30 Supporting Dynamic Load Balancing in a Parallel Data Mining Middleware, Tekin Bicer, Gagan Agrawal

Workshop notes

The workshop notes are available in PDF

Description

Today is an exciting time to be a data mining researcher. There is a huge quantity of available data, and a high demand for knowledge extraction from this data that only data mining can provide. Moreover, with the democratization of parallel computing solutions, be it multicore processors, GPUs, clusters or grids, there has never been more raw computing power available.

There is a catch however: parallel computing power does not come for free. Designing efficient parallel algorithms requires new skills, many of which are yet to discover. Most current programming language does not make parallel programming easy, leading to new research on different ways of programming. Then performance analysis, which is a crucial point for data mining algorithms, get much more complex, as the scaling with the number of computing elements must also be studied. Bad performance can be caused by an inefficient usage of the underlying parallel system architecture. These problems quickly arise as data mining algorithms often need to perform complex computations on huge volumes of data, with irregular and unpredictable computing loads, and generating large quantities of intermediate data and/or results. This pushes the parallel architectures to their limits, forcing data mining researchers to get a precise understanding of how these architectures work and to rethink existing algorithms.

The PDM workshop provides a venue for all data mining researchers working on parallel data mining, whatever the parallel support used. Despite differences in support, the main problems of parallelism remain the same, discussions between researchers working on different parallel supports or different domains of data mining will allow cross-fertilization and help everyone to progress.

Submissions of early research and innovative ideas are highly encouraged.

Researchers applying parallel data mining to real-world data and applications are also most welcome.

Topics of interest

New parallel data mining algorithms, especially in pattern mining, classification, clustering
Multicore CPU data mining
GPU data mining
Cluster-based or grid-based distributed data mining
Cloud data mining
Peer-to-peer data mining
Performance analysis of parallel data mining algorithms
Theoretical foundations of parallel data mining
Cache-aware / Cache-oblivious data mining algorithms
Parallel data-mining over streams
Parallel implementations of data mining algorithms with programming languages well adapted to parallelism, for example functional languages
Applications using parallel data mining in either an industrial or a scientific context

Submission

Please upload your paper in PDF format at: SUBMIT HERE

The paper should have at most 10 pages and use the SIAM style available at: http://www.siam.org/tex/books/soda2e.all.

More details on SIAM paper formatting can be found here. The paper must clearly present author information including name, affiliation and email.

Each paper will receive at least two reviews to determine acceptance.

Important dates


Papers due:	January 14, 2011 Extended deadline : January 21, 2011
Notifications:	~~February 4, 2011~~
Camera ready:	~~February 11, 2011~~
Workshop:	~~April 30, 2011 (half-day)~~

Registration

Attendees are required to register for SDM 2010 so that no separate registration is needed for this workshop. A one-day registration for the conference is available. Workshop attendees do not have to register at the complete conference rate. Click here for more details.

Program Committee

Chair: Alexandre Termier (Grenoble University, France)

Srinivasan Parthasarathy (Ohio State University, USA)
Georges Karypis (University of Minnesota, USA)
Shirish Tatikonda (IBM Almaden, USA)
Geoffrey C. Fox (Indiana University, USA)
Anne Laurent (University of Montpellier 2, France)
Jean-Francois Méhaut (Grenoble University, France)
Benjamin Négrevergne (Grenoble University, France)
Claudio Lucchese (ISTI-CNR Pisa, Italy)
Nicolas Hanusse (Bordeaux University, France)
Sadok Ben Yahia (Tunis-El Manar University, Tunisia)
Mario Rosario Guarracino (ICAR-CNR, Italy)
Domenico Talia (University of Calabria, Italy)
Maurice Tchuente (University of Yaoundé 1, Cameroon)

Organizing Committee

Alexandre Termier
first name [dot] last name [at] imag.fr
LIG (Laboratoire d'Informatique de Grenoble)
Université Joseph Fourier
681 rue de la Passerelle
B.P. 72, 38402 Saint Martin d'Hères
FRANCE
Phone: +33 4 76 82 72 07
Fax: +33 4 76 82 72 87
http://membres-liglab.imag.fr/termier/

Last modified:

Date: 2011-05-11 16:43:53 CEST

Org version 7.5 with Emacs version 23

Validate XHTML 1.0

Parallel Data Mining Workshop 2011

Table of Contents

Hilton Phoenix East/Mesa, Mesa, Arizona, USA

April 30, 2011

Program

Workshop notes

Description

Topics of interest

Submission

Important dates

Registration

Program Committee

Organizing Committee

Last modified: