社区首页 >专栏 >原创译文 | 直击苹果发布会,深度学习功能Create ML,似乎看起来没什么用?

发布2018-07-25 16:53:05
导读:今年WWDC苹果一款硬件都没有发布,被称为“史上最软苹果发布会”。苹果发布了 iOS12、macOS Mojave,MacOS和iOS的联动堪称生产力工具,但是很多人忽略了苹果面向开发者推出的 Create ML 功能,本文将进行详细介绍。(文末更多往期译文推荐)

苹果在发布会上向开发者推出了一项新功能——Create ML。





Create ML的意义在于,让你在你自己的笔记本上就能完成机器学习。就介绍来看,把数据拖放到界面上,进行一些个性化设置,如果你使用的是顶配iMac Pro,只需20分钟即可准备好训练模型。它还会压缩模型,以便你可以更轻松地将其应用在APP里(这些功能似乎已包含在Apple ML工具中)。这主要是因为它应用了Apple自己的愿景和语言模型,而不是从头构建新的模型。

但实际上,模型的质量在很大程度上取决于训练网络的“层”的性质、安排和精度,以及训练的时间。比如使用MacBook Pro训练,一小时可完成十万亿次的训练量。如果您将这些数据发送到云端,您可以选择在10台计算机之间分配这十万亿次的训练量,在6分钟内即可获得相同的结果,或者可以在一小时内完成百万亿次的训练量,反正肯定会得到一个更好的模型。






Apple’s Create ML is a nice feature with an unclear purpose

Apple announced a new feature for developers today called Create ML. Because machine learning is a commonly used tool in the developer kit these days, it makes sense that Apple would want to improve the process. But what it has here, essentially local training, doesn’t seem particularly useful.

The most important step in the creation of a machine learning model, like one that detects faces or turns speech into text, is the “training.” That’s when the computer is chugging through reams of data like photos or audio and establishing correlations between the input (a voice) and the desired output (distinct words).

This part of the process is extremely CPU-intensive, though. It generally requires orders of magnitude more computing power (and often storage) than you have sitting on your desk. Think of it like the difference between rendering a 3D game like Overwatch and rendering a Pixar film. You could do it on your laptop, but it would take hours or days for your measly four-core Intel processor and onboard GPU to handle.

That’s why training is usually done “in the cloud,” which is to say, on other people’s computers set up specifically for the task, equipped with banks of GPUs and special AI-inclined hardware.

Create ML is all about doing it on your own PC, though: as briefly shown onstage, you drag your data onto the interface, tweak some stuff and you can have a model ready to go in as little as 20 minutes if you’re on a maxed-out iMac Pro. It also compresses the model so you can more easily include it in apps (a feature already included in Apple ML tools, if I remember correctly). This is mainly possible because it’s applying Apple’s own vision and language models, not building new ones from scratch.

The quality of a model depends in great part on the nature, arrangement and precision of the “layers” of the training network, and how long it’s been given to cook. Given an hour of real time, a model trained on a MacBook Pro will have, let’s just make up a number, 10 teraflop-hours of training done. If you send that data to the cloud, you could choose to either have those 10 teraflop-hours split between 10 computers and have the same results in six minutes, or after an hour it could have 100 teraflop-hours of training, almost certainly resulting in a better model.

That kind of flexibility is one of the core conveniences of computing as a service, and why so much of the world runs on cloud platforms like AWS and Azure, and soon dedicated AI processing services like Lobe.

My colleagues suggested that people who are dealing with sensitive data in their models, for example medical history or x-rays, wouldn’t want to put that data in the cloud. But I don’t think that single developers with little or no access to cloud training services are the kind that are likely, or even allowed, to have access to privileged data like that. If you have a hard drive loaded with the PET scans of 500,000 people, that seems like a catastrophic failure waiting to happen. So access control is the name of the game, and private data is stored centrally.

Research organizations, hospitals and universities have partnerships with cloud services and perhaps even their own dedicated computing clusters for things like this. After all, they also need to collaborate, be audited and so on. Their requirements are also almost certainly different and more demanding than Apple’s off the shelf stuff.

I guess I sound like I’m ragging for no reason on a tool that some will find useful. But the way Apple framed it made it sound like anyone can just switch over from a major training service to their own laptop easily and get the same results. That’s just not true. Perhaps as the platform diversifies developers will find ways to make it useful, but for now it feels like a feature without a purpose.



0 条评论
