SG: Development of Improved Methods of Biogeography and Ancestry Identification

Information

  • NSF Award
  • 1456634
Owner
  • Award Id
    1456634
  • Award Effective Date
    6/1/2015 - 8 years ago
  • Award Expiration Date
    5/31/2017 - 6 years ago
  • Award Amount
    $ 149,994.00
  • Award Instrument
    Standard Grant

SG: Development of Improved Methods of Biogeography and Ancestry Identification

The ability to identify the geographic origin of an individual using genomic data poses a great challenge due to its complexity and potential misinterpretations. Knowledge of this origin and recent ancestry are essential for research in multiple fields such as anthropology, sociology, forensics, personalized medicine and epidemiology, in which ancestry is an important variable. It also requires understanding that all species, including humans, are mixed to certain degrees and that these mixture patterns can unlock the history and origin of their ancestors. As the proportion of mixed-ancestry individuals increases worldwide, there is a need to better infer their biogeography. Current methods are less than 50% accurate for European populations and highly inaccurate for non-Europeans. This project aims to address this shortage and to develop novel, accurate and efficient tools to study individuals of mixed origin. They will have vast implications for practitioners trying to match cases and controls in disease studies, geneticists studying biodiversity and origins of humans, animals, and plants, as well as many people seeking answers about their past. This work will also contribute to advancement in agricultural genomics by providing selection tools for plant and animal breeders.<br/><br/><br/>Recently published first-generation Geographic Population Structure (GPS1) algorithm, developed by the PIs, provided biogeographical predictions that placed 83% of worldwide non-admixed individuals in their correct country of origin. This proposal builds on the success of the GPS1 algorithm to develop new tools for predicting biogeography in mixed individuals. The current aims are: (1) Development of the next phase of GPS algorithms, which will be capable of predicting the countries of origin of an individual's parents, grandparents or a more complex mixture with high accuracy; (2) Development of a tool to infer local ancestry along the genome; (3) Development of a GPS pipeline to infer the biogeographic origin of plants and animals. Modern computational approaches, such as genetic algorithms, simulated annealing, and others will be used to achieve optimal accuracy and computational efficiency. All algorithms will be implemented in the platform-independent languages R and Matlab and use the mpiR R package and parallel computing toolbox, respectively, to enable parallel processing. This project is supported by the Evolutionary Processes and Biological Anthropology programs at NSF.

  • Program Officer
    Leslie Rissler
  • Min Amd Letter Date
    6/4/2015 - 8 years ago
  • Max Amd Letter Date
    6/4/2015 - 8 years ago
  • ARRA Amount

Institutions

  • Name
    Children's Hospital Los Angeles
  • City
    Los Angeles
  • State
    CA
  • Country
    United States
  • Address
    The Saban Research Institute
  • Postal Code
    900276062
  • Phone Number
    3233615828

Investigators

  • First Name
    Eran
  • Last Name
    Elhaik
  • Email Address
    e.elhaik@sheffield.ac.uk
  • Start Date
    6/4/2015 12:00:00 AM
  • First Name
    Tatiana
  • Last Name
    Tatarinova
  • Email Address
    ttatarinova@chla.usc.edu
  • Start Date
    6/4/2015 12:00:00 AM

Program Element

  • Text
    EVOLUTIONARY GENETICS
  • Code
    7378
  • Text
    Biological Anthropology
  • Code
    1392

Program Reference

  • Text
    BIODIVERSITY AND ECOSYSTEM DYNAMICS
  • Code
    9169
  • Text
    ENVIRONMENT AND GLOBAL CHANGE
  • Text
    Forensic Science
  • Code
    8819
  • Text
    Biological Anthropology
  • Code
    1392