Table Of ContentPredictive value of in silico analysis of variants at
intron-exon junctions: comparison of wet lab and
bioinformatics analysis.
M. Paola Lombardi
Clinical Molecular Geneticist
DNA-diagnostics Unit
Dept. Clinical Genetics, University of Amsterdam
Academic Medical Centre
Klinische Genetica
Introduction
Splice prediction tools are commonly used by diagnostics laboratories to predict the
effect of a genetic variant on splicing.
A large number of prediction tools are available, either as standalone programs or as
part of the Alamut splicing prediction module.
The performance of these algorithms have not been assessed and may give divergent
results.
At this point, no comprehensive interpretation guidelines are available: the user must
decide when a prediction is positive (i.e splice defect) or negative (i.e. no impact on
splicing).
Klinische Genetica
“ Practice Guidelines for the Evaluation of Pathogenicity and the Reporting of Sequence Variants in
Clinical Molecular Genetics (2013)”
In silico splice site predictions
These are generally valid when used correctly and within the scope of their applicability. An NGRL study (
http://www.ngrl.org.uk/manchester/sites/default/files/publications/Informatics/NGRL_ Splice_ Site_ Tools_
Analysis_2009.pdf) showed that the better performing tools were capable of a good degree of accuracy, and that
users can therefore be confident of the safe interpretation of results as part of the assessment of a variant. However
they must be used with caution and should not be relied upon alone.
Summary: it is acceptable to use in silico splice site prediction; however it is unacceptable to base an unequivocal
clinical interpretation solely on this line of evidence. It is, however, acceptable to suggest further investigations based
on the outcome. If this method of prediction is used it is recommended to arrive at an interpretation based upon a
consensus of at least 3 splice site prediction programmes.
It is not currently possible to set criteria for the change in prediction tools scores which should be considered
significant (e.g. 10% deviation from the wild-type score). This remains a matter for local judgement and agreement.
RNA studies
Where possible, RNA studies are the best means of interpreting the consequences of a splicing mutation.
Summary: Given the high predictive value of RNA studies they must be regarded as essential for the definitive
interpretation of putative splicing mutations. However it is recognised that not all laboratories have the facilities to
perform these analyses and that limited expression patterns may mean that the the required tissue is not available for
analysis.
Klinische Genetica
From literature:
Houdayer et al., 2011, In silico prediction of splice site affecting nucleotide variants. (In: In Silico Tools for Gene
Discovery, Methods in Molecular Biology 760)
1. MES-Alamut and SSF-like provide the best predictions in that a decisional threshold can be defined. It is
recommended to look at the score variations rather than the score themselves and to set a limit of significance for
score variations.
2. Variations occurring at the AG-GT consensus canonical site can be considered as impacting splicing
3. Variations occurring at loosely defined consensus positions are also reliable. Using MES-Alamut we recommend
that the mutant score should be at least 15% lower than the wild type score in order to consider the prediction as
positive (deleterious). Using SSF-like, the threshold should be 5%.
4. Variations occurring outside the consensus positions are less reliable with specificity issue mainly and have to be
interpreted in a context -dependent manner.
5. In silico tools cannot yet be used to define splicing outcome beacuse it is the result of a complex interplay between
consensus sequences and other factors from the splicing machinery.
6. To use in silico predictions for diagnostic purposes a decision threshold must be selected. At this point there is no
threshold allowing 100% sensitivity while keeping proper specificity.
7. The better the definition of the consensus (i.e. the higher the score) the better the reliability of the predictions. If
the wild-type consensus site is not scored or poorly defined the tool should not be used.
8. Using MES-Alamut a cryptic site is considered as putative competitor if it reaches at least 80% of the score of the
wild type one.
Klinische Genetica
RNA analysis
Methods
● Blood was collected in PAX tubes and total RNA was isolated with the
protocol provided by the manufacturer (PAX gene blood RNA kit-
PreAnalytiX).
● cDNA was synthesized with Superscript III using the standard protocol
(SuperscriptIII First-Strand Synthesis System for RT_PCR –Invitrogen).
● PCR amplification and sequencing primers were designed with Primer
3. PCR and sequence reactions were conducted with standard protocols
and using the same primer pairs.
Klinische Genetica
Variants at the acceptor splice site with a potential effect on splicing
Mutation In silico prediction RNA analysis
MYBPC3: c.3815-1G>A P: acceptor ss abolished P:intron 33 retention
TTN: c.61121-1G>A P: acceptor ss abolished P: activation acceptor ss
MYBPC3: c.655-3C>G P: acceptor ss almost abolished P: activation de novo acceptor ss
PKP2: c.2578-3A>G P: acceptor ss scores strongly reduced P: activation de novo acceptor ss
MSH2: c.367-11T>G P: acceptor ss scores strongly reduced P: activation de novo acceptor ss
KCNH2: c.2146-3C>G P: acceptor ss almost abolished P: exon skipping + activation acceptor ss
TTN: c.28031-1G>A P?: acceptor ss poorly defined, 2 programs P: activation acceptor ss
MYBPC3: c.1458-6G>A P?: acceptor ss abolished, 2 programs P: activation de novo acceptor ss
MYBPC3: c.2414-8T>A P?: acceptor ss not found P: activation de novo acceptor ss
SCN5A: c.393-5C>A P?: a-ss not found, new ss at 393-2 N-P: no effect on splicing
PRKAR1A: c.709-7_709-2del N-P: mildly reduced scores P?: skipping exon 7 (in a low % of mutant
transcript )
P: Pathogenic P? : Unknown Pathogenicity N-P: Non Pathogenic
Klinische Genetica
TTN: c.28031-1G>A
poorly defined ss
RNA analysis: activation splice site at 28036, 6 nt deletion
Klinische Genetica
MYBPC3: c.2414-8T>A
ss not found
RNA analysis: activation of de novo splice site, insertion AGCCAG
Klinische Genetica
SCN5A: c.393-5C>A
Wild type ss signal not
found
RNA analysis: no effect on splicing
Klinische Genetica
PRKAR1A: c.709-7_709-2del
PPRvKAR1ARKAR1A
-1.1%
+0,1%
-0.18%
-65.2%
-4.2%
RNA analysis: in a low % of transcript skipping of exon 7 (Groussin L et al.,
J of Clin Endo & Metab 2006 91 1943–1949)
Klinische Genetica
Description:part of the Alamut splicing prediction module. • The performance of these algorithms have not been assessed and may give divergent results. • At this