How can experiments be more systematic and comparable?

Participants: Arno, Chuang, Guoli, Matteo, Michael

What is the State of the Art?

What Are We Striving For?

What can WE do?

Where can we obtain realistic workloads and data sets?

What Data Do We Need?

What benchmarks ''do'' exist or ''should'' exist?

How have other communities developed and adopted benchmarks?

What are realistic models for workload generation?

What are good performance metrics?

I would like to draw the attention of the DEBS community to our paper titled "Constructing scalable overlay for pub-sub with many topics", which is published in PODC'07. The paper is available from http://www.ifi.uio.no/~romanvi/Papers/scalable-overlay-theory.ps This work is decidedly not about a new pub-sub system; it rather attempts to formally capture and theoretically analyze a fundamental problem of building and evaluating pub-sub overlays. Since many existing pub-sub systems have been tackling this problem from the practical standpoint, perhaps this paper can be considered a (rather small) step towards creating the unifying theory of pub-sub. Specifically, we believe that our work provides the following potential benefits for the DEBS community: 1. It includes and can be further extended toward evaluation criteria for pub-sub overlays. This may be relevant for the effort of creating commonly used pub-sub benchmarks. 2. It determines theoretical limits of what a practical pub-sub system designer should strive and can hope to achieve. In particular, it includes a nearly optimal centralized algorithm for building an overlay, which can be used as a baseline for distributed implementations in practice. The current paper version only targets topic-based pub-sub. Since this is a conference version limited in length, the list of references is very far from being comprehensive. In particular, we did not cite any major work on content-based pub-sub. We do intend to compile a comprehensive list of citations for the full version of this paper. This is an additional reason why feedback from the DEBS community would be so useful.

What is a solid evaluation methodology?

Simulation and Evaluation (last edited 2009-09-15 21:49:12 by localhost)