NYCCS/Computer Science Seminar
Tevfik Kosar
Louisiana State University
Thursday, April 22, 2010 2:15PM
Location: Computer Science Building, Room 2311
Parallel Computing: Data-Aware Distributed Computing: Enabling Large-Scale Collaborative Science
Abstract:
Applications and experiments in all areas of science are becoming increasingly complex and more demanding in terms of their computational and data requirements. Some applications generate data volumes reaching hundreds of terabytes. Sharing, disseminating, and analyzing these large data sets becomes a big challenge, especially when distributed resources are used. The traditional distributed computing systems closely couple data handling and computation. They consider data resources as second class entities, and access to data as a side effect of computation. This makes the remote access and retrieval of data the main bottleneck in the end-to-end performance, reliability and automation of large-scale data-intensive and dynamic data-driven applications. The inadequacy of traditional distributed computing systems in dealing with complex data handling problem in our new data-rich world has motivated me to create a new computing paradigm called data-aware distributed computing. In my talk, I will present this new computing paradigm and examples of our work in this area so far.