Home > Genomic services > Bioinformatics

Bioinformatics

Special offer: Quick 454 and HiSeq 2000 genome assembly, from 150€/genome in 48h*

  • Assembly of bacterial genomes
  • Assembly of any 454 and HiSeq data
  • Assembly of all kind of genomes

From .fna and .qual data for 454, and .fastQ and .fastA data for Hiseq.* Unit price excluding VAT. Offer valid for a maximum of 10 bacterial genomes sequenced on 454.

Our offer:

Alignment
Comparison
Interpretation: length of overlapping regions, reference coverage in homology percentage, number of contigs, number of scaffolds (paired-end)
Sending of interpretation reporting and alignment results on ftp server

> Contact us

 

Genomes, metagenomes and transcriptomes

To meet growing and ever changing needs, we developed full-service bioinformatics to provide sequences analysis pipeline from conventional technologies or high throughput technologies.

Aware of the importance for researchers to perform their analysis by themselves, we set up training in bioinformatics for biologists and physicians who wish to understand the methods of analysis, and to acquire proficient skills to analyze their own biological data: Genoscreen is a registered training organization (DRTEFP Lille, France), all our courses are eligible for professional training.

Associated with such training provided in user's lab, we offer recently third aspect of bioinformatics services: tailored technological and development tools: Mini, Midi and Maxi code, and ready-to-use packages: FASTA sequences handling, sequence selections in a file and BLAST results analysis.

This e-mail address is being protected from spambots. You need JavaScript enabled to view it

Catalogue - English version

Catalogue - French version

Bioinformatics is constituted by a set of concepts and techniques needed for computer interpretation of biological information. Several fields of application or sub-disciplines of bioinformatics exist, inclunding our specialty:

Sequences bioinformatics, which deals with the analysis of data from the genetic information contained in the sequence of DNA or protein that it encodes. This branch is particularly involved in identifying the similarities between the sequences, the identification of biologically relevant genes or regions in DNA or proteins, based on the sequence of their elementary components (nucleotides, amino acids).

We also developed an expertise in statistical and populations bioinformatics.

Focus : Sequences analysis

Since the invention of DNA sequencing by Frederick Sanger in the second half of the 70's, technological advances in this field were such that the volume of DNA sequences available has increased exponentially with a doubling time around 15 to 18 months, ie a bit faster than the processing power of computers (Moore's Law).

An exponentially growing number of genome sequences or complementary DNAs are available, for which the annotation (or interpretation of their biological function) remains to be done.
The first challenge is to organize that huge mass of information and make it available to the general research community. This was made possible through various databases, accessible on line.

Tools for sequences analyzing should be developed in order to determine their properties..

  • Research of proteins from the translation of known nucleic sequences. This requires the determination of open reading frames of a nucleic acid sequence and its possible translation.
  • Research of sequences in a database from another sequence or sequence fragment. Most frequent used software belong the the BLAST "family" (blast, blastp, blastx, tblastx and their derivatives).
  • Sequence alignment: to find the similarities between two sequences and determine their possible homologies. Alignments are the basis for the construction of kinship according to molecular criteria, as well as the recognition of patterns in a particular protein from its sequence.
  • Research of patterns or consensus structures to characterize the sequences.

Bioinformatics is also involved in sequencing, like for the use of DNA chips (biochips). The principle is based on the characteristic of spontaneously re-forming of the double helix of deoxyribonucleic acid facing the complementary strand. The four DNA basis molecules have the particuliarity to unite in pairs. If a patient has a disease, the strands of DNA extracted from a patient will hybridize with the strands of synthetic DNA representing the analyzed conditions.

Nowadays, bioinformatics is mainly working on next generation sequencing sequence data analysis, ie new technologies created to generate not tens and hundreds of nucleotides at once, but hundreds of thousands, or millions or billions. Next generation analysis need the same methodologies and reflections that older technologies, however these must be adapted to a mass amount of data. Thus the algorithms must be optimized, and computer equipment used must be strong enough.