BioRuby

Bioinformatics

% sudo gem install bio 
% su # ruby setup.rb 
> gem install bio 
#!/usr/bin/env ruby require 'bio' # create a DNA sequence object from a String dna = Bio::Sequence::NA.new("atcggtcggctta") # create an RNA sequence object from a String rna = Bio::Sequence::NA.new("auugccuacauaggc") # create a Protein sequence from a String aa = Bio::Sequence::AA.new("AGFAVENDSA") # you can check if the sequence contains illegal characters # that is not an accepted IUB character for that symbol # (should prepare a Bio::Sequence::AA#illegal_symbols method also) puts dna.illegal_bases # translate and concatenate a DNA sequence to Protein sequence newseq = aa + dna.translate puts newseq # => "AGFAVENDSAIGRL" 
#!/usr/bin/env ruby # you can use Bio::Sequence object as a String object to print, seamlessly dna = Bio::Sequence::NA.new("atgc") puts dna # => "atgc" str = dna.to_s puts str # => "atgc" 
#!/usr/bin/env ruby require 'bio' # create a DNA sequence seq = Bio::Sequence::NA.new("atggccattgaatga") # translate to protein prot = seq.translate # prove that it worked puts seq # => "atggccattgaatga" puts prot # => "MAIE*" 
#!/usr/bin/env ruby require 'bio' # make a 'codon' codon = Bio::Sequence::NA.new("uug") # you can translate the codon as described in the previous section. puts codon.translate # => "L" 
#!/usr/bin/env ruby require 'bio' # make a 'codon' codon = Bio::Sequence::NA.new("uug") # select the standard codon table codon_table = Bio::CodonTable[1] # You need to convert RNA codon to DNA alphabets because the # CodonTable in BioRuby is implemented as a static Hash with keys # expressed in DNA alphabets (not RNA alphabets). codon2 = codon.dna # get the representation of that codon and translate to amino acid. amino_acid = codon_table[codon2] puts amino_acid # => "L" 
#!/usr/bin/env ruby require 'bio' # Generates a sample 100bp sequence. seq1 = Bio::Sequence::NA.new("aatgacccgt" * 10) # Naming this sequence as "testseq" and print in FASTA format # (folded by 60 chars per line). puts seq1.to_fasta("testseq", 60) 
#!/usr/bin/env ruby require 'bio' file = Bio::FastaFormat.open(ARGV.shift) file.each do |entry| # do something on each fasta sequence entry end 
#!/usr/bin/env ruby require 'bio' Bio::FlatFile.auto(ARGF) do |ff| ff.each do |entry| # do something on each fasta sequence entry end end 
#!/usr/bin/env ruby require 'bio' Bio::FlatFile.open(Bio::FastaFormat, ARGV[0]) do |ff| ff.each do |entry| # do something on each fasta sequence entry end end


A BioRuby shell on Rails
Stable release	1.5.2 / 19 November 2018 (2018-11-19)
Repository	github.com/bioruby/bioruby
Written in	Ruby
Type	Bioinformatics
License	GPL
Website	bioruby.open-bio.org

Class names	Description
Bio::Sequence::NA, Bio::Sequence::AA	Nucleic and amino acid sequences
Bio::Locations, Bio::Features	Locations / Annotations
Bio::Reference, Bio::PubMed	Literatures
Bio::Pathway, Bio::Relation	Graphs
Bio::Alignment	Alignments

Class names	Description
Bio::GenBank, Bio::EMBL	GenBank / EMBL
Bio::SPTR, Bio::NBRF, Bio::PDB	SwissProt and TrEMBL / PIR / PDB
Bio::FANTOM	FANTOM DB (Functional annotation of mouse)
Bio::KEGG	KEGG database parsers
Bio::GO, Bio::GFF	Bio::PROSITE FASTA format / PROSITE motifs
Bio::FastaFormat, Bio::PROSITE	FASTA format / PROSITE motifs

Class names	Description
Bio::Blast, Bio::Fasta, Bio::HMMER	Sequence similarity (BLAST / FASTA / HMMER)
Bio::ClustalW, Bio::MAFFT	Multiple sequence alignment (ClustalW / MAFFT)
Bio::PSORT, Bio::TargetP	Protein subcellular localization (PSORT / TargetP)
Bio::SOSUI, Bio::TMHMM	Transmembrane helix prediction (SOSUI / TMHMM)
Bio::GenScan	Gene finding (GenScan)

Class names	Description
Bio::Registry	OBDA Registry service
Bio::SQL	OBDA BioSQL RDB schema
Bio::Fetch	OBDA BioFetch via HTTP
Bio::FlatFileIndex	OBDA flat file indexing system
OBDA flat file indexing system	Flat file reader with data format autodetection
Bio::DAS	Distributed Annotation System (DAS)
Bio::KEGG::API	SOAP/WSDL intarface for KEGG

BioRuby

History

BioRuby

Version history^[8]

Installation

Installation of BioRuby

macOS/Unix/Linux

Windows

Usage

Basic Syntax^[10]

Basic Sequence Manipulation

String to Bio::Sequence object

Bio::Sequence object to String

Translation

Translating a DNA or RNA sequence or SymbolList to orotein

Translating a single codon to a single amino acid

Sequence I/O

Writing sequences in Fasta format

Reading in a Fasta file

Classes and modules

Major classes

Basic data structure

Databases and sequence file formats

Wrapper and parsers for bioinformatics tool

File, network and database I/O

Biogem

Popular Biogems

Plugins

See also^[14]

BioRuby

Ruby/bioinformatics links

Sister projects

Blogs

References

External links

#	Biogem	Description	Version
1	bio	Bioinformatics Library	1.4.3.0001
2	biodiversity	Parser of scientific names	3.1.5
3	Simple Spreadsheet extractor	Basic spreadsheet content extraction using Apache poi	0.13.3
4	Bio gem	Software generator for Ruby	1.36
5	Bio samtools	Binder of samtools for Ruby	2.1.0
6	t2 server	Support for interacting with the taverna 2 server	1.1.0
7	bio ucsc api	The Ruby ucsc api	0.6.2
8	entrez	http request to entrez e-utilities	0.5.8.1
9	bio gadget	Gadget for bioinformatics	0.4.8
10	sequenceserver	Blast search made easy!	0.8.7

BioRuby

History

BioRuby

Version history[8]

Installation

Installation of BioRuby

macOS/Unix/Linux

Windows

Usage

Basic Syntax[10]

Basic Sequence Manipulation

String to Bio::Sequence object

Bio::Sequence object to String

Translation

Translating a DNA or RNA sequence or SymbolList to orotein

Translating a single codon to a single amino acid

Sequence I/O

Writing sequences in Fasta format

Reading in a Fasta file

Classes and modules

Major classes

Basic data structure

Databases and sequence file formats

Wrapper and parsers for bioinformatics tool

File, network and database I/O

Biogem

Popular Biogems

Plugins

See also[14]

BioRuby

Ruby/bioinformatics links

Sister projects

Blogs

References

External links

Version history^[8]

Basic Syntax^[10]

See also^[14]