CFTR variations and classification

According to the international recommendations, we should use the term "sequence variant" to indicate any sequence change, pathogenic or not. However, in medical genetics, the term "mutation" is widely used to indicate a "disease-causing sequence change". The term "polymorphism" indicates either a "neutral variant" (non disease-causing change) or a "change found in the general population at a frequency of ≥1%". These terms are imperfect since some "mutation" can result in mild or incomplete phenotypes or have an incomplete penetrance (asymptomatic individuals), and some "polymorphisms" (i.e. common CFTR variants) may contribute to the final amount of functional protein and determine the variable expressivity of complex alleles or haplotypes.

The Human Genome Variation Society (HGVS) proposes to use the term "affects or does not affect function" and to classify variants in 5 categories:

Affects function
Probably affects function
Unknown: VUS or VUCS [variants of unknown (clinical) significance]
Probably does not affect function
Does not affect function
CFTR-France uses the nomenclature recommended by HGVS (Human Genome Variation Society, http://www.hgvs.org/), notably:
  • the 27 exons are numbered from 1 to 27 [instead of 1 to 24 in the historical numbering, with the three additional exons that were missed in the original publication of the gene designated as 6b, 14b, and 17b]
  • the "A" of the translation initiation codon (ATG) corresponds to the position "+1" [instead of +133 in the usual nucleotide numbering used in the legacy nomenclature of variants].

As an example, the disease-causing variation p.Phe508del corresponds to the nucleotide change c.1521_1523del and is located within exon 11 [instead of exon 10 in the historical exon numbering].


carte
(Modified from Claustres et al., 2005)

Reference sequences


CFTR geneOMIM #602421
Ensembl Gene ID ENSG00000001626
Chromosomal sequence NC_000007.14 (excepted for position g.117559479: A is the wild type nucleotide, G a variant nucleotide)
Genomic sequence NG_016465.3 (excepted for position g.98696: A is the wild type nucleotide, G a variant nucleotide)
Coding sequence NM_000492.3 (excepted for position c.1408: A is the wild type nucleotide, G a variant nucleotide)
Protein sequenceNP_000483 (excepted for position 470: Methionine is the wild type amino acid, Valine is a variant amino acid)



Download



  • Schematic representation of the 27 CFTR exons with usual and HGVS numbering of nucleotides bordering exons. Amino acids phasing and corresponding domains of the CFTR protein (NBDs, R domain, ...) are also illustrated (pdf file).
At least 5 different types of alterations can be distinguished, based on the classification suggested by Castellani et al., 2008:

  • Variants associated to classical forms of CF, with markedly reduced or absent CFTR protein. They correspond to the group A
  • Variants associated to CFTR-RDs (related disorders such as CBAVD); many variants in this group alter CFTR quantity and/or function but retain residual activity. They correspond to the group B.
  • Variants with clinical varying consequences (or "broad range of phenotypes"): they can be associated either to CF or CFTR-RDs (A/B).
Named
"disease-causing"
in CFTR-France
  • "Neutral" variants: they do not meet functional or clinical evidence to be disease-causing. They are classified in the group C.
"Non disease-causing"
in CFTR-France
  • "Indeterminate" variants : the effect on CFTR function is either not clearly proven or unknown. They are classified in the group D.
"Unclassified variant" (VUS)
in CFTR-France


Up to date, about 2,000 CFTR variations have been reported to the international CF database CFMD (Cystic Fibrosis Mutation Database). However, worldwide, less than 20 variants are found with a frequency ≥0.1% in the CF population (Castellani et al., 2008). The vast majority of CFTR disease-causing variations are rare or private, emphasizing the high allelic heterogeneity of the CFTR gene.

Frequent disease-causing variations

The most common disease-causing variation in populations from European descent is a deletion of 3 nucleotides, p.Phe508del (c.1521_1523del) (legacy name F508del). In France, this variant globally corresponds to 67% of alleles identified in patients with CF on average, with a wide regional variation: up to 50% of CF alleles in the most southern regions and almost 80% in Brittany (Claustres et al., 2000, Audrézet et al., 2014).
The loss of only one amino acid (Phenylalanine at position 508) causes a severe misfolding of the CFTR protein. The mutant chloride channel shows an impaired glycosylation with a dramatic decrease in exocytic trafficking and peripheral stability, and is finally subjected to rapid proteasome-mediated degradation (p.Phe508del CFTR processing).
In France, 3 other disease-causing variations have a frequency over 1%: p.Gly542* (G542X), c.1585-1G>A (1717-1G>A) and p.Asn1303Lys (N1303K).

The frequency and type of mutants vary widely according to geographic and ethnic origin of individuals.

carte
Regional distribution of the most common disease-causing variations in France
(The French network of CFTR laboratories. Extracted from Des Georges et al., 1998 and Claustres et al., 2000).
Mutations are reported with their legacy names.


Rare variants

Only a few hundred of the 2,000 CFTR variants have been experimentally investigated and the disease liability of 159 most common variants found in the CF populations worldwide has been recently assessed for clinical severity, functional consequences and disease penetrance (Sosnay et al., 2013).
This gap in the knowledge of clinical relevance of the vast majority of rare CFTR variants raises major issues in diagnosis and genetic counseling.
Classifications of CFTR variations based on experimental functional consequences or on computational methods that predict the impact of variations both help to interpret variants and to characterize their disease liability.

A/ Classification based on the functional impact of variations on transcription efficiency, RNA splicing, protein processing or function.

Common and well-characterized variations that have been experimentally tested

Classes I, II, III, IV, V and VI



classes
details
B/ Classification based on computational predictions

Rare and poorly documented variations that have not been tested

VUS classes 1, 2, 3, 4 and 5

(VUS = Unclassified Variant)



carte

details

Remarks on the classification based on the functional impact of variations


1/ This classification proposed more than 20 years ago by Welsh & Smith (1993) was based on functional criteria related to chloride transport. It does not take into consideration the numerous other regulative roles of CFTR.

2/ Only a very limited number of mutations have been assessed for CFTR dysfunction in experimental assays and epidemiological studies, so some variants may be misclassified.

3/ Many CFTR variants have been shown to impair more than a single process; an example is the most common mutation, p.Phe508del, which also creates a gating defect (class III) amongst the few misfolded proteins that escape the proteasome degradation pathway and reach the apical membrane (Dalemans et al., 1991).

4/ Clinicians and pharmaceutical industry should consider updated information on the liability and classification of CFTR variants.

A/ Classification based on the effect on CFTR gene or protein function


carte

Class I: Defective protein synthesis, leading to absence of functional CFTR channel at the surface of epithelial cells. This class includes nonsense variants and variants that result in a premature termination codon (PTC) such as variations affecting canonical splice sites and insertions or deletions that cause a shift in the reading frame. PTC variants generally cause the degradation of mRNA by the control mechanism NMD (nonsense-mediated mRNA decay), leading to the absence of protein production; e.g: p.Arg553* (R553X), p.Gly542* (G542X), p.Trp1282* (W1282X). There are few exceptions where transcripts bearing a PTC are stable but the mutant protein is rapidly degraded; e.g: p.Arg1162* (R1162X).

Class II: Defective processing and maturation, leading to absence of functional CFTR channel at the surface of epithelial cells. A number of mutations cause a defect in post-translational folding, maturation (incomplete glycosylation) and localization of CFTR, leading to its rapid degradation. Although there are ethnic and geographical variations, approximately 90% of European CF patients harbor a class II variant on at least one mutant allele, p.Phe508del accounting for the vast majority of them. Other class II variants are missenses such as p.Asn1303Lys (N1303K) or p.Arg560Thr (R560T).

Class III: Defective channel regulation/gating, leading to a present but non-functional CFTR protein at the apical membrane. Some missense variations, mostly located in Nucleotide Binding Fold (NBD) domains, are properly processed and addressed to the membrane, but the amino acid substitution leads to markedly reduced anion conductance and resistance to activation (preventing channel opening). The most widely studied example is the "gating mutant" p.Gly551Asp (G551D).

Class IV: Defective channel conductance, leading to a present CFTR protein at the apical membrane but with reduced anion conductance. Some missense variations, mostly located in trans-membrane domains involved in the formation of the channel pore, produce a CFTR protein with altered conductance properties such as ion selectivity. Examples include p.Arg347Pro (R347P), p.Arg117His (R117H).

Class V: Mutations causing abnormal splicing, leading to a mix of CFTR mRNA correctly or aberrantly spliced, and thus to reduced amounts of normal CFTR protein. This class includes intronic variations such as "the 5T allele" (c.1210-12T[5]) or c.3718-2477C>T (3849+10kbC>T) and some missense variations altering protein trafficking (p.Ala455Glu) or located within the promoter.

Class VI: Defective stability of CFTR protein, leading to absence or severe reduction of CFTR at the membrane surface. Some missense variants such as p.Asn287Tyr (N287Y) result in CF without affecting biosynthesis or channel gating: they accelerate endocytic retrieval from the plasma membrane and lead to a severe reduction in the steady-state level of CFTR at the cell surface (Silvis et al., 2003).

Generally, classes I, II, III variants are considered as "severe" whereas classes IV, V and VI, which retain some residual CFTR function, are considered as "milder" variations. However, CFTR dysfunction can vary depending on organs, and may change over time.

B/ Classification based on in silico predictions


Because functional studies require considerable time and financial resources, only a few CFTR variants have been studied so far. This is the reason why so many researchers have developed bioinformatic tools that use protein structure information and common ancestral sequences to make "in silico" predictions of the effect of a variation, especially in cases of rare variants with very restricted information on associated clinical and epidemiological characteristics.

Bioinformatics tools such as PolyPhen and SIFT have been widely used to evaluate amino acid substitutions in CFTR, but can make significant errors or be totally discordant. As reminded by The Association for Clinical Genetic Science and the Dutch Society of Clinical Genetic Laboratory Specialists (2013), "no one in silico tool can be considered superior nor achieve complete accuracy". To perform in silico analysis, they recommend the use of at least three different programs based on different algorithms to reduce the prevalence of either a false negative or false positive result. Other researchers have developed algorithms that predict the likelihood that a variant causes mis-splicing. Coupling bioinformatics resources with functional splicing assays (hybrid minigenes) can help to predict the pathogenicity of CFTR variants of unknown significance (VUS) (Desmet et al., 2010).
Combining clinical, epidemiological (parents and controls), predictive and functional data provide the most efficient system to correctly evaluate the impact of CFTR variants (Sosnay et al., 2011; Raynal et al., 2013, Sosnay and Cutting, 2014).

The Association for Clinical Genetic Science and the Dutch Society of Clinical Genetic Laboratory Specialists (2013) recommended a 5-class system for reporting VUS:

carte




Go to CFTRare