[dataone] BigProv Meeting, Summer School / NASA internship opportunities, DMPTool developer position

Wed Feb 6 15:23:56 PST 2013

Dear DataONE Community

Please find below three items that may be of interest:

   1. BIGPROV'13 Call for Participation
   2. The NASA-University of Virginia Intensive Summer School in Computing
   for Environmental Sciences
   3. DMPTool Software Engineer position at California Digital Library

Amber

************************************
*1. BIGPROV'13 Call for Participation*

BIGPROV'13: International Workshop on Managing and Querying Provenance Data
at Scale
http://sites.google.com/site/bigprov13/
bigprov13 at easychair.org

In conjunction with EDBT/ICDT 2013: http://edbticdt2013.disi.unige.it/
March 22nd, 2013, Genova, Italy *

*Confirmed Keynote speakers*
Dr. Joerg Kraenzlein, MD, PhD -- Medical Director, Big Data, iSOFT Health
GmbH, a CSC company
Dr. Grigoris Karvounarakis, LogicBlox, USA
*
*
*BigProv Research Papers*
Paraskevi Zerva, Steffen Zschaler and Simon Miles. Towards Design Support
for Provenance Awareness: A Classification of Provenance Questions
Pinar Alper, Khalid Belhajjame, Carole A. Goble and Pinar Karagoz.
Enhancing and Abstracting Scientific Workflow Provenance for Data
Publishing
Devarshi Ghoshal and Beth Plale. Provenance from Log Files: a BigData
Problem
Alessandro Spinuso, James Cheney and Malcolm Atkinson. Provenance for
seismological processing pipelines in a distributed streaming workflow
Flavio Costa, Victor Sousa, Daniel de Oliveira, Kary Ocana, Eduardo
Ogasawara, Jonas Dias and Marta Mattoso. Capturing and Querying Workflow
Runtime Provenance with PROV: a Practical Approach
Edoardo Pignotti, Gary Polhill and Peter Edwards. Using Provenance to
Analyse Agent-based Simulations
Bernd Amann, Camelia Constantin, Clement Caron and Patrick Giroux. WebLab
PROV: Computing fine-grained provenance links for XML artifacts

*ProvBench (Short Presentations)*
David Koop, Juliana Freire, Claudio Silva, VisTrails Provenance Traces for
Benchmarking Fernando Chirigati,
Provenance Traces of the Swift Parallel Scripting System, Luiz M. R.
Gadelha Jr., Michael Wilde, Marta Mattoso, Ian Foster
Paolo Missier and Ziyu Chen. Extracting PROV provenance traces from
Wikipedia history pages
Edoardo Pignotti, Gary Polhill, Peter Edward, PROV-O Provenance Traces From
Agent-based Social Simulation
Ashish Gehani, Dawood Tariq, Cross-Platform Provenance
Felipe Horta, Vitor Silva, Flavio Costa, Daniel de Oliveira, Kary Ocana,
Eduardo Ogasawara, Jonas Dias, Marta Mattoso, Provenance Traces from Chiron
Parallel Workflow Engine
Heiko Mueller, Chris Peters, Yanfeng Shu, Andrew Terhorst, Provenance in
Streamflow Forecastinga
Khalid Belhajjame, Jun Zhao, Daniel Garijo, Aleix Garrido, Stian
Soiland-Reyes and Pinar Alper. The Taverna and Wings Workflow PROV-Corpus

*WORKSHOP FOCUS*
Provenance data is poised to become pervasive in key areas of information
management, ranging from traditional areas of science (i.e., life sciences,
earth sciences, astronomy, etc.), to new applications enabled by the Web
(e.g., social sciences, social network analysis, quality and trust in Web
publishing).

As the volume of provenance metadata increases with the volume of the
underlying data whose history it describes, new challenges for managing and
querying provenance at scale emerge, i.e., provenance data is growing in
both "count" and "complexity". It is growing in count because of the very
large number of provenance traces (one for each Twitter message, for
example), and in complexity in the case of  provenance graphs that are
generated from provenance-enabled programming environments (e.g.,
scientific workflow systems) and middleware. Data-intensive science is
bound to produce provenance that fares high on both accounts.

At the same time, emerging standards such as PROV, the W3C recommendation
for provenance modelling and Web-based access, suggest that provenance data
will increasingly be encoded using Semantic Web technology. This in turn
suggests that provenance data will soon form a natural extension of, and
seamlessly blend with, the growing Linked Data Cloud.

The new Managing and Querying Provenance Data at Scale workshop (BIGProv)
stems from these premises. We are interested in exploring the system and
modelling challenges associated with collecting, storing, querying, and
exploiting large volumes of possibly complex provenance data. We seek to
map the state of the art, elicit new research problems, and learn about
existing systems. More specifically, the workshop scope includes the
following topics:

- Automated capture of provenance at multiple layers (system, middleware,
applications)
- Database models, languages, and systems for storing and querying
large-scale provenance
- Provenance and Linked Open Data (LOD): seamless representation and query
models
- Comparison and performance benchmarking of different data architectures
and query models for provenance
- Analysis of existing graph query models and systems for provenance graphs
- Reference datasets for provenance benchmarking
- System descriptions and demonstrations of large-scale provenance and
graph data
- Uniform querying over heterogeneous provenance traces
- Abstraction models for provenance and their applications to user
presentation, visualization, and privacy preservation

*PUBLICATION*
Accepted papers are included in the ACM DL as well as in the official EDBT
workshop proceedings.
*
Workshop Organizers*
Bertram Ludaescher, UC Davis, CA (ludaesch at ucdavis.edu)
Paolo Missier, Newcastle University, UK (pmissier at acm.org)

Proceedings chair:  Victor Cuevas, University of New Mexico and UC Davis,
USA

*
*************************************
*2. The NASA-University of Virginia Intensive Summer School in Computing
for Environmental Sciences*

ISSCENS
The University of Virginia Alliance for Computational Science & Engineering
(UVACSE), in cooperation with the Department of Environmental Science and
NASA, is sponsoring

The NASA-University of Virginia
Intensive Summer School in Computing for Environmental Sciences

What is ISSCENS?

The Intensive Summer School for Computing in Environmental Sciences
(ISSCENS) is an exciting opportunity for students interested in working
with computational applications for atmospheric, ocean, climate, and other
environmental sciences.  Students will acquire hands-on computing
experience and basic training in software engineering and high-performance
computing.

The program is open to graduate students and to undergraduates in their
last year.  Motivated third-year students who plan to go to graduate school
are also encouraged to apply.  A total of 20 students will be selected for
the Summer School, of which 10 will go on to 8-week internships at NASA
centers. No previous programming experience is required as we will teach it
starting from the basics, but students should be interested in computing
and intend to use it in their future research.

Students selected to attend the Summer School may be foreign nationals, but
only US citizens can be NASA interns.

*Applications*
Please complete our  application form (
http://avillage.web.virginia.edu/NASAApp)

*Required Application Materials*
Unofficial transcripts (we will ask for information about specific courses)
Two letters of reference
Statement of intent: the applicant should write a short explanation of how
the Summer School and Internship will fit into his or her personal career
goals

*Dates for Summer 2013*
Summer School:  May 29-June 14
Internships: June 17-August 8

*Application Deadline*
Submit Internship Application for Summer 2013 Session by
March 15, 2013

http://www.uvacse.virginia.edu/isscens/

************************************
*3. DMPTool Software Engineer position at California Digital Library*

California Digital Library is recruiting a software engineer for a one year
development project to enhance the DMPTool (http://dmptool.org/) service
supported by the UC Curation Center (UC3). The DMPTool is a widely used
service that supports University faculty and researchers in creating data
management plans as required by various governmental and private funding
agencies. The development project will augment the existing tool with a
number of new functions and features. Reporting to the DMPTool project
manager, the incumbent will be responsible for refining functional
requirements, UI designs, and technical specifications; implementation of
those specifications; testing and documenting the resulting codebase; and
deployment of the operational system in a production environment. UC3
employs an agile development methodology relying on iterative code
prototyping, assessment, and refinement.

The DMPTool is a Ruby on Rails web application with a MySQL backend
database and is integrated with LDAP- and Shibbolith-based authentication.
It is deployed in a SLES/SUSE Linux VM environment. Candidates will have
demonstrated experience and expertise in these and related web
technologies, as well as in general software development methodologies and
best practices.

For more information and to apply visit:
https://jobs.ucop.edu/applicants/jsp/shared/frameset/Frameset.jsp?time=1360192332140

-- 
Amber E Budden, PhD
Director for Community Engagement and Outreach
DataONE
University of New Mexico
1312 Basehart SE
Albuquerque, NM 87106

Tel: 505-814-1112
Cell: 505-205-7675
Fax: 505-246-6007
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.dataone.org/pipermail/community/attachments/20130206/5e3a9954/attachment.html>