The human genome is a complete set of nucleic acid sequences for humans, encoded as DNA within the 23 chromosome pairs in cell nuclei and in a small DNA molecule found within individual mitochondria. These are usually treated separately as the nuclear genome and the mitochondrial genome. Human genomes include both protein-coding DNA genes and noncoding DNA. Haploid human genomes, which are contained in germ cells (the egg and sperm gamete cells created in the meiosis phase of sexual reproduction before fertilization creates a zygote) consist of three billion DNA base pairs, while diploid genomes (found in somatic cells) have twice the DNA content. While there are significant differences among the genomes of human individuals (on the order of 0.1% due to single-nucleotide variants and 0.6% when considering indels), these are considerably smaller than the differences between humans and their closest living relatives, the bonobos and chimpanzees (~1.1% fixed single-nucleotide variants and 4% when including indels).Although the sequence of the human genome has been (almost) completely determined by DNA sequencing, it is not yet fully understood. Most (though probably not all) genes have been identified by a combination of high throughput experimental and bioinformatics approaches, yet much work still needs to be done to further elucidate the biological functions of their protein and RNA products. Recent results suggest that most of the vast quantities of noncoding DNA within the genome have associated biochemical activities, including regulation of gene expression, organization of chromosome architecture, and signals controlling epigenetic inheritance.
Prior to the acquisition of the full genome sequence, estimates of the number of human genes ranged from 50,000 to 140,000 (with occasional vagueness about whether these estimates included non-protein coding genes). As genome sequence quality and the methods for identifying protein-coding genes improved, the count of recognized protein-coding genes dropped to 19,000-20,000. However, a fuller understanding of the role played by sequences that do not encode proteins, but instead express regulatory RNA, has raised the total number of genes to at least 46,831, plus another 2300 micro-RNA genes. By 2012, functional DNA elements that encode neither RNA nor proteins have been noted. A 2018 population survey found another 300 million bases of human genome that was not in the reference sequence.Protein-coding sequences account for only a very small fraction of the genome (approximately 1.5%), and the rest is associated with non-coding RNA genes, regulatory DNA sequences, LINEs, SINEs, introns, and sequences for which as yet no function has been determined.
2001Feb, 15
The first draft of the complete human genome is published in Nature.
Choose Another Date
Events on 2001
- 6May
Pope John Paul II
During a trip to Syria, Pope John Paul II becomes the first pope to enter a mosque. - 11Sep
September 11 attacks
Two hijacked aircraft crash into the World Trade Center in New York City, while a third smashes into The Pentagon in Arlington County, Virginia, and a fourth into a field near Shanksville, Pennsylvania, in a series of coordinated suicide attacks by 19 members of al-Qaeda. A total of 2,996 people are killed. - 14Sep
September 11, 2001 attacks
Historic National Prayer Service held at Washington National Cathedral for victims of the September 11 attacks. A similar service is held in Canada on Parliament Hill, the largest vigil ever held in the nation's capital. - 17Sep
September 11 attacks
The New York Stock Exchange reopens for trading after the September 11 attacks, the longest closure since the Great Depression. - 7Oct
War in Afghanistan (2001-2014)
The Global War on Terrorism begins as a result of the September 11 attacks. The U.S. invasion of Afghanistan initiates with an air assault and covert operations on the ground.