SEP 06, 2018 8:05 PM PDT

The Human Genome may Contain 20% Fewer Genes Than Thought

WRITTEN BY: Carmen Leitch

The human genome was estimated to contain anywhere from 50,000 to 90,000 genes around the year 2000, and that number has been steadily revised downward. After the human genome was sequenced, it appeared that there were around 20,500 genes. Now, a research team led by scientists at the Spanish National Cancer Research Centre (CNIO) has found that there are probably even fewer genes that code for protein; they estimate that 20 percent of genes that have been classified as coding may actually be non-coding. This work, reported in Nucleic Acids Research, may have serious implications for biomedical research.

Image credit: Pixabay

It has been challenging to determine the exact number of coding genes; the human genome is complex, and there are thousands of genes. The team started by carefully comparing proteins that are expressed in cells, called the proteome, in the GENCODE/Ensembl, RefSeq and UniProtKB reference databases. Of 22,210 coding genes listed, only 19,446 of the genes were found in all three databases. 

The team focused on the 2,764 genes that were only found in one or two of the references; after looking at annotations and experimental evidence, they found that nearly all of them were predicted to be pseudogenes (which have unknown functions but seem to be non-coding), or other genes that don't encode for protein. 

The team also identified 1,470 coding genes in the databases that don’t evolve like other genes, and probably aren't protein-coding. The researchers concluded that 4,234 genes in all are non-coding genes.

"We have been able to analyze many of these genes in detail," explained Michael Tress of the CNIO Bioinformatics Unit "and more than 300 genes have already been reclassified as non-coding." The results are already being included in the new annotations of the human genome by the GENCODE international consortium, of which the CNIO researchers are part.

More work remains before we know everything about the human genome. "Our evidence suggests that humans may only have 19,000 coding genes, but we still do not know which 19,000 genes are,” noted first author Federico Abascal of the Wellcome Trust Sanger Institute in the United Kingdom.

The study may cause some serious ripples in some research. "Surprisingly, some of these unusual genes have been well studied and have more than 100 scientific publications based on the assumption that the gene produces a protein,” added David Juan of the Pompeu Fabra University.

You can learn more about what we do know from the following video, by Harvard University.

Sources: AAAS/Eurekalert! via CINO, Nature Genetics, Nucleic Acid Research

About the Author
  • Experienced research scientist and technical expert with authorships on over 30 peer-reviewed publications, traveler to over 70 countries, published photographer and internationally-exhibited painter, volunteer trained in disaster-response, CPR and DV counseling.
You May Also Like
MAY 13, 2021
Genetics & Genomics
The Unique Caecilians of São Tomé Island
MAY 13, 2021
The Unique Caecilians of São Tomé Island
There are many islands that have unique flora and fauna, like these limbless creatures (Photo © Andrew Stanbridge) of Sã ...
MAY 21, 2021
Microbiology
The Bubonic Plague May Have Had a Lasting Impact on Humans
MAY 21, 2021
The Bubonic Plague May Have Had a Lasting Impact on Humans
The bubonic plague is caused by a bacterium, Yersinia pestis, which is transmitted by fleas. The bacterium can also caus ...
JUN 08, 2021
Genetics & Genomics
Primers and Probes to Assay for SARS-CoV-2 Variants in Research Samples
JUN 08, 2021
Primers and Probes to Assay for SARS-CoV-2 Variants in Research Samples
Today, many scientists are investigating SARS-CoV-2 variants in their research projects. In order to facilitate screenin ...
JUN 06, 2021
Genetics & Genomics
How Autism-Associated Mutations in One Gene Impact the Brain
JUN 06, 2021
How Autism-Associated Mutations in One Gene Impact the Brain
Autism spectrum disorder is complex; it presents differently in different patients and may be influenced by many factors ...
JUN 16, 2021
Microbiology
DNA - It's What's for Dinner (For Some Bacteria)
JUN 16, 2021
DNA - It's What's for Dinner (For Some Bacteria)
There may be a trillion species of microbes on the planet, so clearly there's still a lot we don't know about these micr ...
JUL 08, 2021
Cell & Molecular Biology
Silent Genetic Substitutions Can Affect Protein Function
JUL 08, 2021
Silent Genetic Substitutions Can Affect Protein Function
Researchers have been investigating whether silent mutations have a biological impact. We know that proteins are made of ...
Loading Comments...