2026 03 04 cookbook sequences by danielle-pinto · Pull Request #28 · BioJulia/BioTutorials

danielle-pinto · 2026-03-06T20:30:54Z

First cookbook tutorial that explains how to read in different file bioinformatics file types

danielle-pinto · 2026-03-06T20:31:25Z

cookbook/assets/mecA.fasta

Thought this file would be alright to add to Git since it is tiny. But I can also just have the user curl it themselves.

Yeah, seems fine

github-actions · 2026-03-06T20:33:11Z

Preview at https://biojulia.github.io/BioTutorials/28

danielle-pinto · 2026-03-06T20:38:16Z

cookbook/sequences.md

+ All of the nucleotides in all of the reads have a quality score of `$`, which corresponds to a probabilty of error of 0.50119.  
+ More information about how to convert ASCII values to quality scores [here](https://people.duke.edu/~ccc14/duke-hts-2018/bioinformatics/quality_scores.html).  
+ This would be quite poor if we were looking at Illumia data.  
+ However, because of how PacBio chemistry works,  


Want to confirm this information/explanation with you

I've never used PacBio data before!

Yeah, I think this is right, though maybe just grab an illumina dataset instead (or something from FormatSpecimens) so as not to need this particular bit - no need to over complicate things

danielle-pinto · 2026-03-06T20:43:16Z

cookbook/sequences.md

+The SRR (sample run accession number) is the unique identifier within SRA   
+and corresponds to the specific sequencing run. 
+
+In a later tutorial, we will discuss how to download this file in Julia using the SRR.


Is there any example code that can be shared on how to do this? Or I can show how this package can be used in a one line on the terminal here.

https://github.com/BioJulia/BioServices.jl is the cannonical way.

But also, another useful addition to the cookbook would be showing how to call shell commands from julia

kescobo · 2026-03-09T19:44:48Z

cookbook/assets/mecA.fasta

Yeah, seems fine

kescobo · 2026-03-09T19:49:09Z

cookbook/index.md

+
+This cookbook will provide a series of "recipes" that will help get started quickly with BioJulia so you can doing some bioinformatics!
+
+We have tutorials for reading in files, performing alignments, and using tools such as BLAST,    


We will have, no

Though another option would be to bring in FormatSpecimens.jl... maybe not for the very first one.

kescobo · 2026-03-09T19:50:55Z

cookbook/sequences.md

+The SRR (sample run accession number) is the unique identifier within SRA   
+and corresponds to the specific sequencing run. 
+
+In a later tutorial, we will discuss how to download this file in Julia using the SRR.


https://github.com/BioJulia/BioServices.jl is the cannonical way.

But also, another useful addition to the cookbook would be showing how to call shell commands from julia

kescobo · 2026-03-09T20:03:35Z

cookbook/sequences.md

+```
+curl -L --retry 5 --retry-delay 2 \
+  "https://trace.ncbi.nlm.nih.gov/Traces/sra-reads-be/fastq?acc=SRR12147540" \
+  | gzip -c > SRR12147540.fastq.gz


Re: command line - this can be

run(pipeline( `curl -L --retry 5 --retry-delay 2 "https://trace.ncbi.nlm.nih.gov/Traces/sra-reads-be/fastq?acc=SRR12147540"`, `gzip -c`, "SRR12147540.fastq.gz" ) )

or

run(pipeline( `curl -L --retry 5 --retry-delay 2 "https://trace.ncbi.nlm.nih.gov/Traces/sra-reads-be/fastq?acc=SRR12147540"`; stdout=pipeline(`gzip -c`; stdout="SRR12147540.fastq.gz") ) )

kescobo · 2026-03-09T20:07:58Z

cookbook/sequences.md

+ All of the nucleotides in all of the reads have a quality score of `$`, which corresponds to a probabilty of error of 0.50119.  
+ More information about how to convert ASCII values to quality scores [here](https://people.duke.edu/~ccc14/duke-hts-2018/bioinformatics/quality_scores.html).  
+ This would be quite poor if we were looking at Illumia data.  
+ However, because of how PacBio chemistry works,  


Yeah, I think this is right, though maybe just grab an illumina dataset instead (or something from FormatSpecimens) so as not to need this particular bit - no need to over complicate things

danielle-pinto added 3 commits March 4, 2026 22:04

add initial draft

faf2890

add sequence tutorial

38c2d71

fix typos and update mecA.fasta

bdd640c

danielle-pinto requested a review from kescobo March 6, 2026 20:30

danielle-pinto commented Mar 6, 2026

View reviewed changes

kescobo approved these changes Mar 9, 2026

View reviewed changes


		This cookbook will provide a series of "recipes" that will help get started quickly with BioJulia so you can doing some bioinformatics!

		We have tutorials for reading in files, performing alignments, and using tools such as BLAST,

Conversation

danielle-pinto commented Mar 6, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Mar 6, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants