Groth, P., Miles, S., Fang, W., Wong, S. C., Zauner, K. P. and Moreau, L. (2005) Recording and Using Provenance in a Protein Compressibility Experiment. In: The 14th IEEE International Symposium on High Performance Distributed Computing (HPDC-14), 24-27 July, 2005, Research Triangle Park, North Carolina.
Download
|
PDF
162Kb |
Abstract
Very large scale computations are now becoming routinely
used as a methodology to undertake scientific research.
In this context, ‘provenance systems’ are regarded
as the equivalent of the scientist’s logbook for in silico experimentation:
provenance captures the documentation of
the process that led to some result. Using a protein compressibility
analysis application, we derive a set of generic
use cases for a provenance system. In order to support
these, we address the following fundamental questions:
what is provenance? how to record it? what is the performance
impact for grid execution? what is the performance
of reasoning? In doing so, we define a technologyindependent
notion of provenance that captures interactions
between components, internal component information and
grouping of interactions, so as to allow us to analyse and
reason about the execution of scientific processes. In order
to support persistent provenance in heterogeneous applications,
we introduce a separate provenance store, in
which provenance documentation can be stored, archived
and queried independently of the technology used to run the
application. Through a series of practical tests, we evaluate
the performance impact of such a provenance system. In
summary, we demonstrate that provenance recording overhead
of our prototype system remains under 10% of execution
time, and we show that the recorded information successfully
supports our use cases in a performant manner.
| Item Type: | Conference or Workshop Item | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Creator/Authors: |
| ||||||||||||
| Keywords: | Provenance, Grid, protein compressibility | ||||||||||||
| Research Group: | Old ECS Groups > Science and Engineering of Natural Systems Old ECS Groups > BIO@ECS Research Group Current ECS Groups > Web and Internet Science Old ECS Groups > Intelligence, Agents, Multimedia Current ECS Groups > Agents, Interaction and Complexity | ||||||||||||
| Date: | 2005 | ||||||||||||
| Information about this record: | |||||||||||||
| Performance Indicator: | EZ~06~06~04 | ||||||||||||
| Citations: | ISI: 1, Google Scholar: 56 | ||||||||||||
| Downloads (2010): | 18 | ||||||||||||
| ID Code: | 10910 | ||||||||||||
| Last Modified: | 23 Sep 2011 10:32 | ||||||||||||
| Deposited On: | 24 May 2005 by Groth, Paul | ||||||||||||
Tools & Metadata
Download Statistics
Members of ECS may view the download statistics dashboard for this record.
References in Article
Select the SEEK icon to attempt to find the referenced article. If it does not appear to be in this archive you will be forwarded to the paracite service. Poorly formated references will probably not work.
Corrections
ECS staff and postgraduates may modify this record









