Weekly Conference Call

6/09/2014

Gina's Barley genotype calls, continous values 0 to 2. - store in a new column. Drop the old Illumina raw score data. Should we combine the alleles table with the genotype_data table?
Eduards GBS genotype data - use 128 base pairs
update INSTALL document with new R package.

User group conference call - bug in download page, should allow phenotype, genotype, or both. In experiment design should the select trial give you a list of lines?

Lab research update - should add a note to download page that describes why the filter setting on the production sever need higher settings

6/02/2014

Big Projects for T3

1. predefined data sets including filtered lines and markers. Used to access data from published studies. Do analysis tools use this or only download.
2. single login using Google account authentication
3. imputation of marker data
4. Automated validation -- T3 should perform checks on newly uploaded data to identify possible errors (phenotypic outliers, unlikely marker scores based on local haplotypes, ...).
5. Diversity panel selection -- T3 should be able to perform an analysis that identifies the N most genetically diverse lines, based on marker alleles, in the currently selected set of lines.
6. accelerate genotype data retrieval (HDF5? using byte storage and compression similar to TASSEL) consider data growth
7. provide option to download genotype data by experiment not consensus. email results of slow query. Do users need analysis of data? (maybe use predefined data set)
8. manage lists - a page to view/edit/save/name selections

9. external access to data and tools (common API)
structured data format on web pages (RDF) send journal from EBI, present idea to users
10. jBrowse, gBrowse
11. links to external websites
12. using iPlant eXceed super computer resources, TASSEL resources
13. Sandbox, plot level loading and analysis (playground) needs documentation. provide sandbox on iPlant to preserve user changes.
14. Field book integration

development time, priority

June 9th User group agenda
allele conflicts update - finding and deleting bad datadownload page - option for only phenotype download
experiment design - describe agricoleae, integration with field book
GBS naming, loading

June 2nd agenda
Barley GBS project

tcap oat server - can mysql/apache run multiple copies?

template files should be in sync with template page.

5/27/2014

follow up with Katmandoo and Seattle API
- start work with http://docs.breeding.apiary.io/

follow up with Trevor and Field Book development
- new on github https://github.com/trife/Field-Book
- new SQLite structure?, connection with T3 and Cassavabase
1) login API using google for single signon
2) drop box or other for upload and backup data
3) connect to T3 for traits and layout

next user group conference call - June 9th?

experiment design page
- converted layout of madii design to t3 format
- for madii the check column need to be changed from line number to boolean
- for madii the order of check lines determine if primary or secondary
- wait for new version of madii script

GBS data
- Marker name: contig3917765_1al-5481 or IWGSC2012_3917765_1al-5481
For the different versions of some chromosome-arms: IWGSC2012_4as_v2-1185913

Big Projects meeting
- Clay send out announcement and request for topics

PAG Asia

5/19/2014

report from crop database API workshop (Dave M)
Genome Back Office - sharing genotyping data (JL, Ed Buckler, Susan McCouch)
git site - https://github.com/plantbreeding
http://docs.breeding.apiary.io/
http://www.ebi.ac.uk/rdf/documentation/uris-ebi-data
SSL configuration of email and logon (Dave H)
experiment design page (Clay)
row/column not correct
Iowa State University (Lawrence Lab) national group of maize researchers evaluating T3

5/12/2014

- request for database dump from evogene
should remove user table, add link from documentation page

- marker panels for PVP application (Vic)
we have "my Marker Panels" working

- Mark has assigned codes for proprietary lines in Cornell Master
private lines coded as NYCNL##, Vic will load malt machine first for checking

- reformatting marker sequences for synthetic (Vic)
new web script to translate AB into ACTG before loading into db

- API for T3
wheatplus/api

- On GWAS page made changes to label of result exports. Need to review with JL and others if the page is clear

- Alleles for all lines - add genotype experiment column

5/5/2014

- Do we need to alphabetize alleles for Synop GBS data? (Vic)
- How to set an existing phenotype data point to 'missing'? (David M)
- GWAS
- added option for variance calculation method (Clay)
- interpreting the Validation plot (Vic)
- Download page - added selection option for genotype data download (Clay)
- User group meeting May 12th (Trial Design demo, Clay)
   add dialog box for upload and check lines
   add list of traits measured
   add links to android field book
   add links to phenotype import
- New template for phenotype data from multiple trials per file (DaveM)

- Methods section for manuscript (Vic)
- phenotypes for replicated checks not stored uniquely, can't edit (DaveM)
- HWW = "Hard Red Winter Wheat" [Eduard]? "Hard White Wheat"? (DaveM)

4/28/2014

T3 modules
    Presentation-abstraction-control (PAC), model-view-control (MVC)
    functional programming    modular programming - technique that emphasizes separating the functionality of a program into independent, interchangeable modules
    Dave M will meet with Gates foundation to see if there is common ground between groups
    WebEx with developers of Katmandoo

API
    cropontology.org/api
    symantec web
    linked data

Changing Variety names of lines in T3 that are also in GRIN to a lab-specific code names.
     If we do this (i.e. create new unique names for those 17 lines with WB1
     through WB17 as primary T3 names), we are going to have to edit and
     re-upload all four of the phenotype trials and genotype data for the LRpanel.
     Adult Leaf Rust Response
     This request is to recode the line names to use experiment specific names (this is relatively easy)

Download page, genotype data optional
add check box to select genotype data
on map selection page find way to make page faster or background computation optional

Compare trials page, plots for more than 2 trials

marker import file for Eduard's GBS data

interchange between DArTdb, Katmandoo, T3
    DArTdb - internal LIMS at DArT
    KDDart - data storage and integration platform hosted by DArT
    Katmandoo - database and client software (trial management, pedigree(beta), windows mobile(beta), crossing tool, molecular marker, inventory.
    Fieldscorer 4 Android - collecting trait data in field

4/21/2014

GBS

Cornell Master - check with Lynn to code Pioneer lines

Eduard GBS - save assembly version

Download page - make genotype download optional

POPSEQ data - more wheat large data coming from Jessie, do not save imputed data

Anchoring and ordering NGS contig assemblies by population sequencing (POPSEQ)

Analysis

Karen - curren page compares two trials, need to compare check/controls across all trials, locations, years

Colaboration

discuss with Lee Hickey how to get all data into one database

North American Barley Researchers Workshop, June 29 to July 2, 2014, University of Minnesota
Kevin Smith, Karen Beaubien
July 2 14:00-17:00 Special Workshop on "Big Data" at Science Teaching & Student Services, 222 Pleasant Street SE

Page tree