Accessing the data

Interesting paths and connections through the data.

Accessing the data

Postby jcbradley » Tue Sep 01, 2009 1:40 pm

Where are the data - processed or not - located on this site? What tools do you need to visualize?
jcbradley
 
Posts: 1
Joined: Tue Sep 01, 2009 1:33 pm

Re: Accessing the data

Postby donpellegrino » Tue Sep 08, 2009 11:17 am

Thanks for the post. I am working on a new page for the site to document the methodology in terms of the data transformations. I will include links to all of the data involved in each experiment. I'll let you know when I have the new page on-line.

The source code for the visualization tool can be found in the source control management system at http://cluster.ischool.drexel.edu/~st96 ... 05/commit/. The page includes links to download the source code of latest version in zip and tar formats. To make the tool easier for non-programmers to run I am working on a distribution via a Firefox plug-in. More on that effort can be found in another post here in the forums (viewtopic.php?f=2&t=6).
donpellegrino
 
Posts: 14
Joined: Wed Aug 19, 2009 1:52 pm

Data Documentation

Postby donpellegrino » Tue Sep 08, 2009 1:21 pm

I have added a new page to the site that describes the data aspects of the project. It is on-line at http://cluster.cis.drexel.edu/~st96wym4/flumap/data/. The page documents the general data collection process for the sequence data. It also list the available meta-data for the sequence records. I plan to add additional pages for each run and experiment. Links to download specific sets of data could be added to those pages. Let me know what you think and if would like to see specific sets made available in specific formats to support analyses.
donpellegrino
 
Posts: 14
Joined: Wed Aug 19, 2009 1:52 pm

Re: Accessing the data

Postby donpellegrino » Sat Jan 16, 2010 10:51 am

I have created a new code repository for the purpose of exposing all of the original influenza data that is collected as well as the post-processed results as a single HDF5 file. The new code repository, "exp007: Influenza Data Processing" is online at [http://cluster.ischool.drexel.edu/~st96wym4/flumap/cgit/cgit.cgi/exp007/]. The attached diagram is an export of the Dia file in that repository documenting the NCBI sources and the aggregator that I am building to read the records and calculate the derived fields.
Attachments
Data Deployments.png
Export of the Dia diagram "exp007/doc/Data Deployments.dia" on Jan. 16, 2010.
Data Deployments.png (83.84 KiB) Viewed 617 times
donpellegrino
 
Posts: 14
Joined: Wed Aug 19, 2009 1:52 pm


Return to Analyses

Who is online

Users browsing this forum: No registered users and 1 guest

cron