Significant scientific advances are increasingly achieved through complex sets of computations and data analyses. These computations, often represented as workflows of executable jobs and associated data flows, may comprise thousands of steps. Each step may integrate diverse models and data sources, which may be developed by different groups. The applications and data may be also distributed in the execution environment. The assembly and management of such workflows present many challenges, and increasingly ambitious scientific inquiry is continuously pushing the limits of current technology.

Today’s workflow systems are able to manage quite complex computations that include thousands of components, use dozens of data repositories, and harness resources at dozens of sites. However, these applications are structurally simple compared with new emerging requirements from scientists to handle streaming data, accommodate interactive steering, support event-driven analysis, and enable their creation through collaborative design processes involving many scientists across disciplines.

To examine the nature of these challenges and to consider what steps should be taken to address them, a Workshop on the Challenges of Scientific Workflows was held at the National Science Foundation on May 1-2, 2006. The meeting brought together domain scientists, computer scientists, and social scientists to discuss requirements of future scientific applications and the challenges that they present to current workflow technologies.

  • Final Workshop Report pdf
  • Executive Summary pdf
  • Informational Flyer about the workshop pdf
  • Final Presentation Slides pdf

NEW: Yolanda Gil, Ewa Deelman, Mark Ellisman, Thomas Fahringer, Geoffrey Fox, Dennis Gannon, Carole Goble, Miron Livny, Luc Moreau, Jim Myers, "Examining the Challenges of Scientific Workflows," Computer , vol. 40, no. 12, pp. 24-32, December, 2007.

Available from the publisher here
Available in its original form here

This workshop was funded by NSF under the office of the CISE Information and Intelligent Systems Division (IIS), by Program Director Maria Zemankova , Ph.D. under grant number IIS-0629361.

A description of the NSF award can be found here

