M. Nedim-alpdemir, A. Mukherjee, N. W. Paton, A. A. Fernandes, P. Watson et al., Contextualised workflow execution in myGrid, European Grid Conference, Lecture Notes in Computer Science, vol.3470, 2005.

S. F. Altschul, T. L. Madden, A. A. Schaffer, J. Zhang, Z. Zhang et al., Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Research, vol.25, issue.17, 1997.
DOI : 10.1093/nar/25.17.3389

W. Apache and . Server, Version 2.2.22)

D. Bhagwat, L. Chiticariu, W. Tan, and G. Vijayvargiya, An annotation management system for relational databases, 30th ACM International Conference on Very Large Data Bases, 2004.

R. Bose and J. Frew, Lineage retrieval for scientific data processing: a survey, ACM Computing Surveys, vol.37, issue.1, 2005.
DOI : 10.1145/1057977.1057978

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.103.4855

S. Callahan, J. Freire, E. Santos, C. Scheidegger, C. Silva et al., VisTrails, Proceedings of the 2006 ACM SIGMOD international conference on Management of data , SIGMOD '06, 2006.
DOI : 10.1145/1142473.1142574

F. Chang, J. Dean, S. Ghemawat, W. Hsieh, D. Wallach et al., Bigtable, 7th USENIX Symposium on Operating Systems Design and Implementation, 2006.
DOI : 10.1145/1365815.1365816

I. T. Foster, J. Vckler, M. Wilde, and Y. Zhao, Chimera: A virtual data system for representing , querying, and automating data derivation, Scientific and Statistical Database Management Conference, 2002.

J. Frew and R. Bose, Earth System Science Workbench: A data management infrastructure for earth science products, Scientific and Statistical Database Management Conference, 2001.
DOI : 10.1109/ssdm.2001.938550

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.123.5040

J. Frew, D. Metzger, and P. Slaughter, Automatic capture and reconstruction of computational provenance, Concurrency and Computation: Practice and Experience, vol.94, issue.5, 2008.
DOI : 10.1002/cpe.1247

A. Gehani and U. Lindqvist, Bonsai: Balanced Lineage Authentication, Twenty-Third Annual Computer Security Applications Conference (ACSAC 2007), 2007.
DOI : 10.1109/ACSAC.2007.45

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.146.4276

A. Gehani, M. Kim, and J. Zhang, Steps toward managing lineage metadata in Grid clusters, 1st Workshop on the Theory and Practice of Provenance, 2009.

A. Gehani, M. Kim, and T. Malik, Efficient querying of distributed provenance stores, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, 2010.
DOI : 10.1145/1851476.1851567

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.190.1706

A. Gehani and M. Kim, Mendel, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, 2010.
DOI : 10.1145/1851476.1851503

A. Gehani, D. Tariq, B. Baig, and T. Malik, Policy-Based Integration of Provenance Metadata, 2011 IEEE International Symposium on Policies for Distributed Systems and Networks, 2011.
DOI : 10.1109/POLICY.2011.12

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.231.1923

B. Glavic and G. Alonso, Perm: Processing Provenance and Data on the Same Data Model through Query Rewriting, 2009 IEEE 25th International Conference on Data Engineering, 2009.
DOI : 10.1109/ICDE.2009.15

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.172.8566

P. Groth and L. Moreau, Representing distributed systems using the Open Provenance Model, Future Generation Computer Systems, vol.27, issue.6, 2011.
DOI : 10.1016/j.future.2010.10.001

A. Heydon, R. Levin, T. Mann, and Y. Yu, The Vesta Approach to Software Configuration Management, 2001.

T. Heinis and G. Alonso, Efficient lineage tracking for scientific workflows, Proceedings of the 2008 ACM SIGMOD international conference on Management of data , SIGMOD '08, 2008.
DOI : 10.1145/1376616.1376716

D. A. Holland, U. Braun, D. Maclean, K. Muniswamy-reddy, and M. Seltzer, Choosing a data model and query language for provenance, 2008.

A. Kementsietsidis and M. Wang, On the Efficiency of Provenance Queries, 2009 IEEE 25th International Conference on Data Engineering, 2009.
DOI : 10.1109/ICDE.2009.206

P. Macko and M. Seltzer, A general-purpose provenance library, 4th USENIX Workshop on the Theory and Practice of Provenance, 2012.

T. Malik and A. Gehani, Dawood Tariq, and Fareed Zaffar, Sketching distributed data provenance, Data Provenance and Data Management for eScience, Lecture Notes in Computer Science, vol.7092, 2012.
DOI : 10.1109/escience.2010.51

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.206.5276

S. Miles, E. Deelman, P. Groth, K. Vahi, G. Mehta et al., Connecting Scientific Data to Scientific Experiments with Provenance, Third IEEE International Conference on e-Science and Grid Computing (e-Science 2007), 2007.
DOI : 10.1109/E-SCIENCE.2007.22

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.116.1455

L. Moreau, B. Clifford, J. Freire, J. Futrelle, Y. Gil et al., The Open Provenance Model core specification (v1.1), Future Generation Computer Systems, 2010.
DOI : 10.1016/j.future.2010.07.005

A. Rajgarhia and A. Gehani, Performance and extension of user space file systems, Proceedings of the 2010 ACM Symposium on Applied Computing, SAC '10, 2010.
DOI : 10.1145/1774088.1774130

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.158.7133

K. Muniswamy-reddy, P. Macko, and M. Seltzer, Making a Cloud provenance-aware, 1st USENIX Workshop on the Theory and Practice of Provenance, 2009.

C. T. Silva, J. Freire, and S. Callahan, Provenance for Visualizations: Reproducibility and Beyond, Computing in Science & Engineering, vol.9, issue.5, 2007.
DOI : 10.1109/MCSE.2007.106

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.408.9911

M. Szomszor and L. Moreau, Recording and Reasoning over Data Provenance in Web and Grid Services, International Conference on Ontologies, Databases and Applications of Semantics, 2003.
DOI : 10.1007/978-3-540-39964-3_39

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.106.4373

D. Tariq, M. Ali, and A. Gehani, Towards Automated Collection of Application-Level Data Provenance, 4th USENIX Workshop on the Theory and Practice of Provenance, 2012.

J. Widom, Trio: A system for integrated management of data, accuracy and lineage, 2nd Conference on Innovative Data Systems Research, 2005.

J. Zhao, C. A. Goble, R. Stevens, and S. Bechhofer, Semantically Linking and Browsing Provenance Logs for E-science, 1st IFIP International Conference on Semantics of a Networked World, 2004.
DOI : 10.1007/978-3-540-30145-5_10

W. Zhou, M. Sherr, T. Tao, X. Li, B. Loo et al., Efficient querying and maintenance of network provenance at internet-scale, Proceedings of the 2010 international conference on Management of data, SIGMOD '10, 2010.
DOI : 10.1145/1807167.1807234

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.169.2250