Googles DeepMind aces protein folding

first_imgComplex of bacteria-infecting viral proteins modeled in CASP 13. The complex contains four separate subunits that were modeled individually. By Robert F. ServiceDec. 6, 2018 , 12:05 PM DeepMind’s AlphaFold Sign up for our daily newsletter Get more great content like this delivered right to you! Country Google’s DeepMind aces protein folding Data: abcdefg hijkl mnop qrstu vwxyz 1234 56789 Country * Afghanistan Aland Islands Albania Algeria Andorra Angola Anguilla Antarctica Antigua and Barbuda Argentina Armenia Aruba Australia Austria Azerbaijan Bahamas Bahrain Bangladesh Barbados Belarus Belgium Belize Benin Bermuda Bhutan Bolivia, Plurinational State of Bonaire, Sint Eustatius and Saba Bosnia and Herzegovina Botswana Bouvet Island Brazil British Indian Ocean Territory Brunei Darussalam Bulgaria Burkina Faso Burundi Cambodia Cameroon Canada Cape Verde Cayman Islands Central African Republic Chad Chile China Christmas Island Cocos (Keeling) Islands Colombia Comoros Congo Congo, the Democratic Republic of the Cook Islands Costa Rica Cote d’Ivoire Croatia Cuba Curaçao Cyprus Czech Republic Denmark Djibouti Dominica Dominican Republic Ecuador Egypt El Salvador Equatorial Guinea Eritrea Estonia Ethiopia Falkland Islands (Malvinas) Faroe Islands Fiji Finland France French Guiana French Polynesia French Southern Territories Gabon Gambia Georgia Germany Ghana Gibraltar Greece Greenland Grenada Guadeloupe Guatemala Guernsey Guinea Guinea-Bissau Guyana Haiti Heard Island and McDonald Islands Holy See (Vatican City State) Honduras Hungary Iceland India Indonesia Iran, Islamic Republic of Iraq Ireland Isle of Man Israel Italy Jamaica Japan Jersey Jordan Kazakhstan Kenya Kiribati Korea, Democratic People’s Republic of Korea, Republic of Kuwait Kyrgyzstan Lao People’s Democratic Republic Latvia Lebanon Lesotho Liberia Libyan Arab Jamahiriya Liechtenstein Lithuania Luxembourg Macao Macedonia, the former Yugoslav Republic of Madagascar Malawi Malaysia Maldives Mali Malta Martinique Mauritania Mauritius Mayotte Mexico Moldova, Republic of Monaco Mongolia Montenegro Montserrat Morocco Mozambique Myanmar Namibia Nauru Nepal Netherlands New Caledonia New Zealand Nicaragua Niger Nigeria Niue Norfolk Island Norway Oman Pakistan Palestine Panama Papua New Guinea Paraguay Peru Philippines Pitcairn Poland Portugal Qatar Reunion Romania Russian Federation Rwanda Saint Barthélemy Saint Helena, Ascension and Tristan da Cunha Saint Kitts and Nevis Saint Lucia Saint Martin (French part) Saint Pierre and Miquelon Saint Vincent and the Grenadines Samoa San Marino Sao Tome and Principe Saudi Arabia Senegal Serbia Seychelles Sierra Leone Singapore Sint Maarten (Dutch part) Slovakia Slovenia Solomon Islands Somalia South Africa South Georgia and the South Sandwich Islands South Sudan Spain Sri Lanka Sudan Suriname Svalbard and Jan Mayen Swaziland Sweden Switzerland Syrian Arab Republic Taiwan Tajikistan Tanzania, United Republic of Thailand Timor-Leste Togo Tokelau Tonga Trinidad and Tobago Tunisia Turkey Turkmenistan Turks and Caicos Islands Tuvalu Uganda Ukraine United Arab Emirates United Kingdom United States Uruguay Uzbekistan Vanuatu Venezuela, Bolivarian Republic of Vietnam Virgin Islands, British Wallis and Futuna Western Sahara Yemen Zambia Zimbabwe Ready, set, fold! Points above the red line show protein-folding predictions where AlphaFold won. It lost those below the line. Those on the line were essentially a tie. Email 25 25 Andriy Kryshtafovych/University of California, Davis 100 50 Click to view the privacy policy. Required fields are indicated by an asterisk (*) 0 Other top competitors Data: abcdefg hijkl mnop qrstu vwxyz 1234 56789 100 75 Turns out mastering chess and Go was just for starters. On 2 December, the Google-owned artificial intelligence firm DeepMind took top honors in the 13th Critical Assessment of Structure Prediction (CASP), a biannual competition aimed at predicting the 3D structure of proteins.The contest worked like this: Competing teams were given the linear sequence of amino acids for 90 proteins for which the 3D shape is known but not yet published. Teams then computed how those sequences would fold. Though London-based DeepMind had not previously joined this competition, the predictions of its AlphaFold software were, on average, more accurate than those of its 97 competitors.How close was the race? By one metric, not very. For protein sequences for which no other information was known—43 of the 90—AlphaFold made the most accurate prediction 25 times. That far outpaced the second place finisher, which won three of the 43 tests. 50 Protein Data Bank So AlphaFold lapped the competition? Well, not exactly. When you track how much AlphaFold won or lost by in each case, the results look much closer. That’s shown in the graph below. It shows AlphaFold’s performance on the vertical axis and that from the best other group on the horizontal axis. Points above the red line show predictions where AlphaFold won. Points below, it lost. And those on the red line were essentially a tie. The upshot? AlphaFold won a lot of rounds, with an average margin of 15% accuracy improvement over other groups on the toughest 43 tests, says John Moult, CASP’s lead organizer and a computational biologist at the University of Maryland in Rockville. ​/Science 50 75 0 75 ​Data: Andriy Kryshtafovych, U.C. Davis 25 0 So, what was going on? David Baker, a CASP organizer, participant, and computational modeling expert at the University of Washington in Seattle, notes that DeepMind’s scientists built on two algorithm strategies pioneered by others. First, by comparing vast troves of genomic data on other proteins, AlphaFold was able to better decipher which pairs of amino acids were most likely to wind up close to one another in folded proteins. Second, related comparisons also helped them gauge the most probable distance between neighboring pairs of amino acids and the angles at which they bound to their neighbors. Both approaches do better with the more data they evaluate, which makes them more apt to benefit from machine learning computer algorithms, such as AlphaFold, that solve problems by crunching large data sets. DeepMind scientists “are extremely good at machine learning and have a superb team” with deeper pockets than most academic groups, Baker says.Still, not bad for a newbie. “Give them credit,” adds John Moult, another CASP organizer and a computational biologist at the University of Maryland in Rockville. “They came from nowhere.”last_img read more