Intranet Tools

nb. next round of REF2013 will NOT be using data from eprints.ecs, but the central university REF interface.

RSS 1.0 Feed
RSS 2.0 Feed
Atom Feed
 

Benchmarking Workflow Discovery: A Case Study From Bioinformatics

Goderis, A., Fisher, P., Gibson, A., Tanoh, F., Wolstencroft, K., De Roure, D. and Goble, C. (2009) Benchmarking Workflow Discovery: A Case Study From Bioinformatics. Concurrency: Practice and Experience . ISSN 1040-3108 (In Press)

Download

[img]
Preview
PDF
423Kb

Abstract

Automation in science is increasingly marked by the use of workflow technology. The sharing of workflows through repositories supports the verifability, reproducibility and extensibility of computational experiments. However, the subsequent discovery of workflows remains a challenge, both from a sociological and technological viewpoint. Based on a survey with participants from 19 laboratories, we investigate current practices in workflow sharing, re-use and discovery amongst life scientists chiefly using the Taverna workflow management system. To address their perceived lack of effective workflow discovery tools, we go on to develop benchmarks for the evaluation of discovery tools, drawing on a series of practical exercises. We demonstrate the value of the benchmarks on two tools: one using graph matching, the other relying on text clustering.

Item Type:Article
Creator/Authors:
Antoon Goderis
Paul Fisher
Andrew Gibson
Franck Tanoh
Katy Wolstencroft
David De Roure
Carole Goble
Keywords:Scientific Workflow, Bioinformatics, Discovery, Benchmark, Taverna, myExperiment
Research Group:Old ECS Groups > Intelligence, Agents, Multimedia
ISSN:1040-3108
Date:15 February 2009
Information about this record:
Performance Indicator:EZ~07~01~11
Citations:ISI: 3, Google Scholar: 7
Downloads (2010):146
ID Code:17107
Last Modified:23 Sep 2011 10:37
Deposited On:14 Feb 2009 20:08 by De Roure, David

Tools & Metadata

Download Statistics

Last month

Last year

Members of ECS may view the download statistics dashboard for this record.

References in Article

Select the SEEK icon to attempt to find the referenced article. If it does not appear to be in this archive you will be forwarded to the paracite service. Poorly formated references will probably not work.

1. Daniela Berardi, Giuseppe De Giacomo, Maurizio Lenzerini, Massimo Mecella, and Diego Calvanese.

Synthesis of underspecified composite e-services based on automated reasoning. In 2nd International

Conference on Service Oriented Computing ICSOC, pages 105-114. ACM Press, 2004.

2. A. Bernstein, E. Kaufmann, C. Brki, and M. Klein. How similar is it? Towards personalized similarity

measures in ontologies. In 7 Internationale Tagung Wirtschaftsinformatik, February 2005.

3. Abraham Bernstein and Mark Klein. Towards high-precision service retrieval. In Proceedings of the First

International Semantic Web Conference (ISWC), Sardinia, Italy, 2002. Springer.

4. J. C. Corrales, D. Grigori, and M. Bouzeghoub. BPEL Processes Matchmaking for Service Discovery.

In Conference on Cooperative Information Systems (COOPIS), LNCS 4275, pages 237-254, Montpellier,

France, 2006.

5. X. Dong, A. Halevy, J. Madhavan, E. Nemes, and J. Zhang. Similarity search for web services. In Proc.

of the 30th VLDB Conference, Toronto, Canada, 2004.

6. Schahram Dustdar and Wolfgang Schreiner. A survey on Web services composition. Int. J. Web and

Grid Services, 1(1), 2005.

7. Yolanda Gil, Ewa Deelman, Mark Ellisman, Thomas Fahringer, Geoffrey Fox, Dennis Gannon, Carole

Goble, Miron Livny, Luc Moreau, and Jim Myers. Examining the challenges of scientific workflows.

Computer, 40(12):24-32, December 2007.

8. Antoon Goderis. Workflow re-use and discovery in bioinformatics. PhD thesis, School of Computer

Science, The University of Manchester, 2008.

9. Antoon Goderis, Christopher Brooks, Ilkay Altintas, Edward A. Lee, and Carole Goble. Heterogeneous

Composition of Models of Computation. Future Generation Computer Systems (FGCS), Accepted for

publication.

10. Antoon Goderis, Peter Li, and Carole Goble. Work°ow discovery: requirements from e-science and a

graph-based solution. International Journal of Web Services Research (JWSR), 5(4), 2008.

11. R. L. Goldstone. MIT encyclopedia of the cognitive sciences, chapter Similarity, pages 757-759. MIT

Press, Cambridge, MA, 2001.

12. Nikiforos Karamanis. Entity Coherence for Descriptive Text Structuring. Phd thesis, School of

Informatics, University of Edinburgh, 2003.

13. Christoph Kiefer, Abraham Bernstein, Hong Joo Lee, Mark Klein, and Markus Stocker. Semantic process

retrieval with iSPARQL. In European Semantic Web Conference (ESWC), pages 609-623, 2007.

14. J. Kim, Y. Gil, and V. Ratnakar. Semantic metadata generation for large scientific workflows. In Int.

Semantic Web Conference (ISWC), Athens, USA, November 5-9 2006.

15. Charles W. Krueger. Software reuse. ACM Comput. Surv., 24(2), 1992.

16. Bendick Mahleko and Andreas Wombacher. Indexing business processes based on annotated finite state

automata. In ICWS, pages 303-311, 2006.

17. Brahim Medjahed, Athman Bouguettaya, and Ahmed K. Elmagarmid. Composing Web services on the

Semantic Web. VLDB J., 12(4), 2003.

18. D. Miers, P. Harmon, and C. Hall. The 2007 BPM suites report. http://www.bptrends.com.

19. Tom Oinn, Mark Greenwood, Matthew Addis, Nedim Alpdemir, Justin Ferris, Kevin Glover, Carole Goble,

Antoon Goderis, Duncan Hull, Darren Marvin, Peter Li, Phillip Lord, Matthew Pocock, Martin Senger,

Robert Stevens, Anil Wipat, and Chris Wroe. Taverna: Lessons in creating a workflow environment for

the life sciences. Concurrency and Computation: Practice and Experience: Special Issue on Scientific

Workflows, 18(10):1067-1100, 2005.

20. E. Reiter and S. Sripada. Should corpora texts be gold standards for NLG? In INLG, pages 97{104, New

York, USA, 2002. Harriman.

21. David De Roure, Carole Goble, and Robert Stevens. Designing the myExperiment Virtual Research

Environment for the Social Sharing of Workflows. In Third IEEE International Conference on e-Science

and Grid Computing, pages 603-610, Bangalore, India, December 10-13 2007.

22. Carlos E. Scheidegger, Huy T. Vo, David Koop, Juliana Freire, and Claudio T. Silva. Querying and

creating visualizations by analogy. IEEE Trans. Vis. Comp. Graph., 13(6):1560-1567, 2007.

23. S. Siegel and J. N. Castellan. Nonparametric Statistics for the Behavioral Sciences. McGraw-Hill, 1988.

24. Ioan Toma, Kashif Iqbal, Dumitru Roman, Thomas Strang, Dieter Fensel, Brahmananda Sapkota,

Matthew Moran, and Juan Miguel Gomez. Discovery in Grid and Web services environments: A survey

and evaluation. Multiagent and Grid Systems Special Issue on Advances in Grid services Engineering

and Management, 3(3):341-352, 2007.

25. A. Wombacher. Evaluation of technical measures for work°ow similarity based on a pilot study. In

CoopIS, Montpellier, France, November 1-3 2006.

26. C. Wroe, R. Stevens, C. Goble, A. Roberts, and M. Greenwood. A suite of DAML+OIL ontologies to

describe bioinformatics web services and data. Intl. J. of Cooperative Information Systems, 12(2):197-224,

2003.

27. Chris Wroe, Carole Goble, Antoon Goderis, Phillip Lord, Simon Miles, Juri Papay, Pinar Alper, and Luc

Moreau. Recycling workflows and services through discovery and reuse. Concurrency and Computation:

Practice and Experience, 19(2):181-194, 2007.

28. Jun Zhao, Carole Goble, Robert Stevens, and Daniele Turi. Mining Taverna's semantic web of provenance.

Concurrency and Computation: Practice and Experience, 20(5):463{472, 2008.

29. Y. Zhao, M. Wilde, and I. Foster. Applying the virtual data provenance model. In Int. Provenance and

Annotation Workshop (IPAW), Chicago, USA, May 3-5 2006.

Corrections

ECS staff and postgraduates may modify this record

  Welcome from Deputy Head of School (Research) Research Prospectus Industrial Partnerships New Research Students Notes for Guidance New Research Students Notes for Guidance
The ECS EPrints Repository supports OAI 2.0 with a base URL of http://eprints.ecs.soton.ac.uk/cgi/oai2

EPrints is free software developed by the University of Southampton to facilitate Open Access to research.
EPrints