We’re excited to announce a workshop coming November 16-17 on using NCBI data & tools in the Cloud!
The National Center for Biotechnology Information (NCBI) has created some of our favorite hits, such as the Sequence Read Archive (SRA) and BLAST®, both of which are indispensable to the computational biology community. Did you know that these data and services can now be scaled up to fit the needs of researchers who may not have access to enough compute power, storage, and time for analysis on their own resources?
NCBI will lead a 2-day workshop introducing new cloud-ready tools. These tools include the Prokaryotic Genome Annotation Pipeline (PGAP), ElasticBLAST for scaling up bulk BLAST alignment tasks for large-sized queries, and GCP’s BigQuery for SRA data to bring the power of structured queries to SRA data archives. As the massive amount of biological data we are collectively generating increases, cloud services provide possible solutions for bringing code and compute power to the data, rather than the other way around. We will talk over practical costs and logistics of using cloud platforms, as well as walk through some examples of these tools so you can see them in action. Demonstrations, hands-on exercises, and plenty of ‘office hours’ time to connect with the developers and team leads from NCBI will allow you ample opportunity to find out how to utilize the cloud to boost your research. This is an entirely online experience, but registration is required beforehand so that we can prepare user accounts.
Mon. Nov 16th: 10 AM Eastern – 5 PM Eastern
- Introduction to the cloud, including how to get started
- Cloud-based searching & analysis, including
- Prokaryotic Genome Annotation Pipeline (PGAP) for bacterial and archaeal genomes
- ElasticBLAST for scaling bulk BLAST alignment tasks
- BigQuery for searching SRA metadata
Tues. Nov 17th: 10 AM Eastern – 4 PM Eastern
- Practice with independent activities
- NCBI ‘office hours:’ 1-on-1 discussions about your questions & projects
Participants will be selected for this 30-participant workshop (maximum) based on a first come, first served basis combined with their knowledge of command line/SQL, research interests, and commitment to attend the workshop. If you sense a potential conflict with being able to attend this workshop, please let us know so that we can enable other (waitlisted) participants to benefit from this workshop.
Registration link: https://nih.webex.com/nih/k2/j.php?MTID=t985d3f34e143ed7bd937732e63363a97