0条Plus

谷歌推出史上最强人脸识别系统

Derrick Harris 2015年03月24日

谷歌研究人员日前声称，该公司正在开发的FaceNet人工智能系统是迄今为止最精确的人脸识别技术。这套系统不仅能够在庞大的数据库中迅速识别两张脸是否一样，还能将人名和脸匹配，甚至能将那些看起来最像或最不像的脸归集在一起。FaceNet的横空出世意味着“深度学习”这一人工智能技术又向前迈进了一大步。

姿势和光线一直都是人脸识别的大难题。该图显示了两张脸在不同组合中所体现出来的“差距”（差距为0.0意味着两张脸完全一样）。供图：谷歌公司

有些人总喜欢夸口说：“我从来不会忘记别人的长相。”在人工智能研究突飞猛进的今天，还要这么夸口就有点奇怪了。事实上，现在有些电脑能记住2.6亿张脸。

上周，谷歌公司的三位研究人员发表了一篇有关全新人工智能系统的研究论文。这一系统名为FaceNet，谷歌号称它是迄今为止最精确的人脸识别技术。面对一个名为“人面数据库”（Labeled Faces in the Wild）的常用人脸识别数据库时，FaceNet识别的准确率近乎百分之百。这个数据库容纳了网上搜集的一万三千多张人脸照片。而在面对一个含有2.6亿张人脸照片的庞大数据库时，这个系统的准确率也超过86%。

研究人员声称，面对“人面数据库”时，他们主要测试该系统的“确认能力”。就本质而言，他们衡量的是这套算法在判断两张照片是否同属一人时到底有多准确。

去年12月，一个中国研究团队也声称，对这套数据库的识别准确率超过99%。去年，Facebook公司的研究人员发表论文称，他们也能做到超过97%的准确率。根据这篇论文援引的一些研究人员的说法，人类对该数据库的识别准确率仅有97.5%。

不过，谷歌研究人员采用的方法绝不只是确认两张脸是否一样这么简单。这套系统还能将人名和脸匹配——经典的人脸识别技术，甚至能把看起来最像或最不像的脸归集在一起。

目前这还仅仅是研究而已，但它预示着，在不远的将来，我们经常在网上视频或大片里看到的那种能惩治犯罪、加强监控的电脑将更加触手可及。比起在交友应用Tinder上划来划去，它可能会使网上交友更加简单（也更停留于表面）。

很喜欢1998年左右时的布拉德•皮特？这个数据库里有500张看起来很像他的脸。

一开始，我们会看到谷歌的FaceNet及Facebook的DeepFace系统在各自的网络平台上运行。它们会让用户更加方便地（或者说更加自动化地）给照片贴上标签，找到要找的人，因为这些算法知道照片中的这个人是谁，即使这些照片并没有姓名标记。此外，这类系统还能让网络公司更加方便地基于照片人物的身份，来分析它们的用户社交网络，评判全球流行趋势及名人的受欢迎程度。

尽管谷歌和Facebook在人脸识别技术上最近才取得这类进步，但与之类似的电脑系统早就无处不在。它们都含有一种名为“深度学习”的人工智能技术。事实证明，这种技术能够极其有效地完成识别物体（按照某些标准来看，机器在这方面已经比人类要强了）、识别语音及理解书面文字等机器辨别任务。

除了谷歌和Facebook外，微软、百度和雅虎也在“深度学习”研究上投入重金。这种算法已经应用在一些我们常用的功能上了，比如智能手机语音控制、Skype实时翻译、短信预测输入法及先进的图像搜索等（如果你已经将一些图片上传至Google+账户里，你就可以试试用它们来搜索特定目标）。Spotify和Netflix公司正在研究如何利用深度学习技术更智能地推荐视频。贝宝公司则将其用于打击欺诈。

“I never forget a face,” some people like to boast. It’s a claim that looks quainter by the day as artificial intelligence research continues to advance. Some computers, it turns out, never forget 260 million faces.

Last week, a trio of Google GOOG -0.66% researchers published a paper on a new artificial intelligence system dubbed FaceNet that it claims represents the most-accurate approach yet to recognizing human faces. FaceNet achieved nearly 100-percent accuracy on a popular facial-recognition dataset called Labeled Faces in the Wild, which includes more than 13,000 pictures of faces from across the web. Trained on a massive 260-million-image dataset, FaceNet performed with better than 86 percent accuracy.

Researchers benchmarking their facial-recognition systems against Labeled Faces in the Wild are testing for what they call “verification.” Essentially, they’re measuring how good the algorithms are at determining whether two images are of the same person.

In December, a team of Chinese researchers also claimed better than 99 percent accuracy on the dataset. Last year, Facebook researchers published a paper boasting better than 97 percent accuracy. The Facebook FB 1.66% paper points to researchers claiming that humans analyzing images in the Labeled Faces dataset only achieve 97.5 percent accuracy.

However, the approach Google’s researchers took goes beyond simply verifying whether two faces are the same. Its system can also put a name to a face—classic facial recognition—and even present collections of faces that look the most similar or the most distinct.

This is all just research, but it points to a near future where the types of crime-fighting, or surveillance-enhancing, computers we often see on network television and blockbuster movies will be much more attainable. Or perhaps a world where online dating is even simpler (and shallower) than swiping left or right on Tinder.

Have a thing for Brad Pitt circa 1998? Here are the 500 profiles that look the most like him.

At first we’ll see systems like Google’s FaceNet and Facebook’s aforementioned system (dubbed “DeepFace”) make their way onto those company’s web platforms. They will make it easier, or more automatic, for users to tag photos and search for people, because the algorithms will know who’s in a picture even when they’re not labeled. These types of systems will also make it easier for web companies to analyze their users’ social networks and to assess global trends and celebrity popularity based on who’s appearing in pictures.

Though Google and Facebook’s advances in facial recognition are relatively new, computer systems like this can be found all around us today. They incorporate an artificial intelligence technique called deep learning, which has proven remarkably effective at so-called machine perception tasks such as recognizing objects (by some metrics, machines are now better at this than are people), recognizing voices, and understanding the content of written text.

Aside from Google and Facebook, companies including Microsoft MSFT 0.32% , Baidu, and Yahoo YHOO 0.63% are also investing heavily in deep learning research. The algorithms already power everyday features such as voice control on smartphones, Skype Translate, predictive text-messaging applications, and advanced image-searching. (If you have images uploaded to a Google+ account, go ahead and search them for specific objects.) Spotify and Netflix NFLX -0.82% are investigating deep learning to power smarter media recommendations. PayPal EBAY -0.13% is using it to fight fraud.

1 2 下一页

撰写或查看更多观点, 请打开财富Plus APP

《财富》APP下载

杂志订阅

在社交媒体上找到我们

谷歌推出史上最强人脸识别系统