Loading [MathJax]/jax/output/CommonHTML/config.js
前往小程序,Get更优阅读体验!
立即前往
首页
学习
活动
专区
圈层
工具
发布
首页
学习
活动
专区
圈层
工具
MCP广场
社区首页 >专栏 >访谈 - Sensory CEO Todd Mozer与FindBiometrics CEO Peter O'Neil

访谈 - Sensory CEO Todd Mozer与FindBiometrics CEO Peter O'Neil

作者头像
用户6026865
发布于 2020-01-17 10:10:22
发布于 2020-01-17 10:10:22
4370
举报

Sensory CEO Todd Mozer近日接受了FindBiometrics CEO Peter O'Neil的专访。内容包括了 Sensory于2019年对Vocalize.ai,独立第三方语音和声音生物特征测试实验室的收购,以及包含语音识别和交互,面部识别和模拟的人的虚拟化身(virtual avatar)的应用,以及关于当但隐私保护的探讨等等。

关于Samsung NEON 人工智能Avatar(虚拟人)的视频请看这里 -

http://mpvideo.qpic.cn/0b78baaaeaaas4acznaygfpfacgdaieaaaqa.f10002.mp4?dis_k=04bcd25358811f8edeb6b4e66d2d1e7d&dis_t=1579255784

关于Sensory嵌入式语音和面部生物识别在银行虚拟Avatar助理的应用请看这里 - http://mpvideo.qpic.cn/tjg_3877080668_50000_6e57ba9fe5e64e0fb359645be80473b5.f10002.mp4?dis_k=df0b4b928b695a2f25496aa31e45a25b&dis_t=1579255784

以下为访谈内容 -

Peter O’Neill, President, FindBiometrics: I’d like to start off by talking with you about this past year. It’s been a busy one for our industry. Can you tell us about some of the highlights with regard to your company? Todd Mozer, CEO, Sensory: Sure. It’s been a busy year, and the industry is obviously taking off. There’s a lot of dynamism going on in the industry as well as for Sensory. One of the big things Sensory did in 2019 was acquiring a company called Vocalize, which is a testing house for speech and voice biometric products. We have always done our own in-house testing, where we do a lot of recorded voices that are digitally recognized within a computer where we can model different kind of noise environments and conditions like distances, echoes, reverbs, car noise, people talking, etc. We would do statistically significant volumes of testing to recommend appropriate search points and accuracy assessments. However we found that when our customers were doing testing through speakers and through live audio channels, their results were not always correlated with what we were getting in-house. And so, we started working with Vocalize, and after about six months we knew we wanted to continue with them, and we became their biggest customer. It was really useful to test both digitally and through live audio channels. One thing led to another and we ended up just acquiring them. But they remain independently operated — but if they are working with a Sensory customer we offer a nice discount on their standard pricing. Vocalize follows ANSI and Google/Amazon testing standards and some more traditional approaches to testing, so it provides our customers, or anybody, an independent source for testing, which for Sensory is really valuable. Peter O’Neill, President, FindBiometrics: I know you are constantly, continuously improving your face and voice products, and actually have been a world leader in machine learning and neural nets for years. I think you were probably the first person I ever spoke to in this industry that started to educate me about those areas, but can you tell us what’s the latest? Todd Mozer, CEO, Sensory: The whole industry has really moved towards deep learning in a variety of approaches using neural nets, and as you say, we’ve been doing it for a long time. One of the interesting things that’s been happening is more and more specialized chips are appearing, and because we do everything on-device or on the edge, and embedded is are our focus, this creates some interesting opportunities. More and more chips are emerging that specialize in running these neural net functions on-device in lower power and lower cost platforms. And we’ve started experimenting in porting to some of these platforms.

In 2019, for example, we announced that we partnered with Gyrfalcon, which has a very nice, efficient AI accelerator, and it requires a custom net on their deep learning architecture and we ported and quantized to their neural net. It took a bit of work, but once we got there, what we found is that we could run really, really powerful models much more efficiently than we ever could in something like an Android OS. We are also moving to a Syntiant chip which enables ultra-low power wake words and commands. These chips have been a nice development on the deep learning side. There’s a bit of an irony because Sensory’s first product, the RSC-164 was a low power microcomputer with a specialized neural net processor on chip. Peter O’Neill, President, FindBiometrics: And in terms of your product portfolio, can you give our readers a little bit of an update as to what the latest is? Todd Mozer, CEO, Sensory: Our roots are in speech recognition, and with the speech recognition side we do wake words and we do small vocabulary command and control that’s extremely robust to noise, and we can do large vocabulary, continuous speech recognition, and things that feel like dictation. We also have some really nice NLU engines, and have three flavors of NLU that vary in size, so we can do a relatively small footprint solution with NLU. We can do broad domain language models. We’re doing a lot more domain-specific assistant type functions and we have added a new technique for multi-wake words to support Amazons Voice Interoperability Initiative. That’s what we’re doing on the speech recognition side. We’re also doing biometric voice where we can do text dependent or text independent speaker verification. On the computer vision side of things, we have the ability to do biometrics on a person’s face, where we have very good anti-spoofing technology. We have started to use the camera and we’re looking at the face to detect expressions and to understand demographics of the user. So we’re able to build a very interesting user profile combining face and voice together that gives us a lot of data for analytics of the customer, which can help the user in getting what they want faster and help the sellers in bringing them the things that they really want so that they can sell more and sell faster.

We’re finding this is a very interesting use case, and we’re putting it all together because we have all these different technologies that can have a common interface with an avatar that you talk to and it talks back to you. In essence, it’s kind of like a shopping assistant or a purchasing assistant that can help a person get what they want and help the entities that they’re interacting with better understand what they want without the person even saying it.

Peter O’Neill, President, FindBiometrics: And would the target market in that area Todd, be for the marketing and advertising folks? Is that for retail? Can you give me an example of how that would actually be used?

Todd Mozer, CEO, Sensory: Conceptually it’s for anybody that wants to sell something, but the initial demonstrations and the initial places that we’re targeting are quick service restaurants and large retail shops. There’s a millennial-driven phenomenon where people like to order ahead, and this is going on with restaurant food, groceries and more. The idea being that you call in with an app and fill out what you want and then you can show up and pick it up. And we take it one step further, where you can use your voice to do all the ordering and all the interfacing with the automated selling agent. And they get the benefit of all the data analytics that goes with a device that you’re talking to, that hears you and sees you while you’re communicating.

Sensory Shopping Assistant Avatar for coffee shop请看这里 -

Peter O’Neill, President, FindBiometrics: Very cool. As we head into the next decade, with the current speed of advancement and deployment, we’re seeing a lot go on. What would you say are some of the key advances that you’ve seen, generally, in our industry over the past 10 years? Todd Mozer, CEO, Sensory: Well, over the past 10 years, deep learning’s really taken off and the value of data has emerged and the rise of the assistants. I think the assistant market, which is Google and Amazon and others, is a huge, huge phenomena. They’re really taking over the whole consumer electronic space, and just over the Thanksgiving holiday, Amazon’s top-selling product was one of their speakers. And I had heard that they had priced some of the older Echo Dots at $10, so they’re just doing huge volumes and getting them out there to the market, which means more and more people getting used to interacting with things by voice. This huge growth will drive all these other sensor functions and products, including biometrics and peripheral devices like light switches or thermostats.

One of the phenomenon that’s going on in this market is driven by governments that are saying, “Hey, there’s all this private data that should be kept private, but these guys are taking it.” There’s a lot of fear by consumers about what these companies are doing with their microphones and their cameras and there have been literally dozens of reports coming out over the last year about people outside the company that are listening to transcripts of things they shouldn’t be listening to, in the bedroom and these kinds of things. So, there’s a movement towards privacy, which brings things more towards on-device, which is where Sensory is very focused. Here in California, in just less than a month, we will have new laws in place that protect the consumer, through the California Consumer Privacy Act (CCPA). And already it’s happened in Europe, and we think that’s going to spread pretty quickly across the States. CCPA will be a model for other states to follow.

Peter O’Neill, President, FindBiometrics: Which is not necessarily a bad thing. I guess you’re speaking about GDPR in the European community. If it makes consumers a little bit more confident that their privacy is protected, I think that’s good on all fronts. Todd Mozer, CEO, Sensory: I think that’s right. There’s a value of having these companies have the data, but there’s also a big, big risk. And there have been study after study which says somewhere between 30 percent and 70 percent of users are concerned about the use of their private data.

Peter O’Neill, President, FindBiometrics: Can I follow up a little bit on what you said earlier about voice and how they are being utilized now? I know that you and I have talked for many years about the fact that voice would be utilized as the main communication tool with all the IoT, automotive, robotics, etc… and now that’s happening. What are we going to be looking at in 10 years time when it comes to frictionless travel, robotics, auto and IoT? Todd Mozer, CEO, Sensory: I think it’s just going to get better and better. Just in the last couple of years, it’s amazing how much the smart speakers have advanced in their capabilities. When they first came out, basically they were music players and you could set alarms, and now you can actually ask a variety of questions and I’d say it gets them right about half the time. And as we go forward in time, it’s going to move towards, they will get more complex questions answered more of the time. We’ll be able to have more dialogue-oriented interactions rather than these kind of momentary pieces of time, and I suspect we will see more proactive Assistant interfaces rather than just reactive.

They’ll know more and more about us, for better or worse, which in a utopian world means that they can help us more and more seamlessly. In a dystopian world, things can get a little creepy, so it’ll be interesting to see how those things play out. But I’m pretty much a believer in the fact that AI is not good or evil and it’s a matter of how it’s deployed. And I think it can be deployed to a whole lot of really, really good benefits, and we just need to be careful about the possible downsides.

Peter O’Neill, President, FindBiometrics: Right, and I think education is critical in that area. We’ve certainly been feeling that in our business. We’re constantly asked questions about these issues.

What we can expect to see from Sensory in the coming few years? Todd Mozer, CEO, Sensory: Because we have so many different technologies, and actually I didn’t even talk about our ability to listen to scenes and sounds and identify what they are, but because we have this wide range of very powerful technologies that can run on-device, we’re combining them more and more together into applications that take advantage of multiple things in parallel. So, we’re going to see more fusion of technologies. We’re doing a lot of work right now with face and voice fusion. And if you think about some of the things that I mentioned, like whether it’s demographics or other kinds of things, they’re really hard to do just looking at a person’s face or just listening to their voice, but when you combine them together, you get added insights.

Right now we’ve done them discretely and done an algorithmic approach to combining face and voice data, and what we’re looking at doing is more of a deep learning fusion, where we’re looking at the face and the voice in parallel to detect different kinds of things that are going on. This could include improving speech recognition and becoming more robust to outside noise. If you can watch a person’s lips while they’re speaking, then you can disregard other people that are talking in the same spectrum. Humans are very, very good at that. We call it the cocktail party effect, and machines can be good at that too, and so we’re working on those sort of things.

Peter O’Neill, President, FindBiometrics: Well, how exciting is that? I love talking with you because you’re on the cutting edge of our industry, and always a pleasure to hear your thoughts as we continue to move rapidly forward in our industry. Thank you very much for your time today. Todd Mozer, CEO, Sensory: Thank you, Peter. It’s always a pleasure talking to you. And yeah, we’re on the bleeding edge, which can be good or bad.

本文参与 腾讯云自媒体同步曝光计划,分享自微信公众号。
原始发表:2020-01-11,如有侵权请联系 cloudcommunity@tencent.com 删除

本文分享自 SmellLikeAISpirit 微信公众号,前往查看

如有侵权,请联系 cloudcommunity@tencent.com 删除。

本文参与 腾讯云自媒体同步曝光计划  ,欢迎热爱写作的你一起参与!

评论
登录后参与评论
暂无评论
推荐阅读
编辑精选文章
换一批
专访 - Sensory CEO Todd Mozer - AI, 3D人脸识别以及其他
Sensory Inc.作为向全球移动设备提供先进的复杂生物识别算法的供应商,于近期展示了其采用面部和声音识别算法的AI虚拟银行助理技术。
用户6026865
2019/10/30
8150
专访 - Sensory CEO Todd Mozer - AI, 3D人脸识别以及其他
ZOOM Release Edge Speech Recognition Powered by Sensory
ZOOM RELEASES EDGE SPEECH RECOGNITION POWERED BY SENSORY
用户6026865
2022/09/02
5690
ZOOM Release Edge Speech Recognition Powered by Sensory
The TOP 44 Leaders in Voice - Sensory CEO荣膺最具远见商业领袖
语音助理(Voice Assistant)已经成为一种现象型产品,已经成为了一种文化符号,成为了继网站,和移动设备之后,的一种新的计算平台。
用户6026865
2019/08/16
4280
The TOP 44 Leaders in Voice - Sensory CEO荣膺最具远见商业领袖
Sensory为全球的第三方设备提供Hey Siri唤醒词
Sensory宣布其TrulyHandsFree - 面向边缘侧设备端的唤醒词和语音识别引擎(edge-based wake-word and phrase recognition engine),面向全球不同国家,推出"Hey Siri”唤醒词。
用户6026865
2021/07/08
7580
Sensory为全球的第三方设备提供Hey Siri唤醒词
GETTING RID OF WAKE WORDS…PLEASE NOT YET!
One of the great things about Sensory is the traction we have had over the years. Not just traction that produces revenues and profits, but traction that gives us insights into what hundreds of multibillion-dollar companies want in their speech solutions. Since Sensory introduced the first commercially successful voice triggers aka wake word that called up a voice assistant (e.g. Samsung Galaxy S2 and MotoX), we have been getting requests for the same thing:
用户6026865
2022/05/17
6760
GETTING RID OF WAKE WORDS…PLEASE NOT YET!
Sensory's Take on Generative AI
Conversations about Large Language Models (LLMs) were once confined to the domain of speech techies, but now it’s gone mainstream.
用户6026865
2023/03/02
2710
Sensory's Take on Generative AI
Buy Now Pay Later, But At What Price? A Case for Face Biometrics
The speed at which we can buy and receive products has escalated with the rise of one click internet shopping, the gig economy for deliveries, and economies of scale through improved logistics, warehousing, and deliveries. The rise of robotic and drone deliveries is going to make it all the easier to get stuff fast. Helping the speed of buying is having our purchasing information stored on our computers and our phones. Buy Now Pay Later (BNPL) makes it all the faster because now we don’t even need to have the cash assets to buy things.
用户6026865
2022/05/17
2890
Buy Now Pay Later, But At What Price? A Case for Face Biometrics
Assessing Biometric Authentication -A Holistic Approach
Biometric authentication is certainly starting to get the attention of the general public. Announcements like the revelation this past fall that over 1 billion stolen passwords had been amassed by a Russian crime ring underscore the fact that the current security systems are flawed, and that new approaches to security are necessary. There is a growing consensus in government and industry (and often confirmed in Hollywood) that biometric approaches are the best path forward. The push by Apple and Samsung to make fingerprint authentication available in their devices is among the most visible applications of biometrics.
用户6026865
2022/09/02
3180
Assessing Biometric Authentication -A Holistic Approach
STM&Sensory Enable Embedded VUI Through STM32Cube Ecosystem
TM32 MCUs pair with Sensory’s VoiceHub technology to streamline development of voice-based user interfaces on wearables, IoT, and smart-home applications
用户6026865
2022/09/02
4310
16位顶级数据科学家语录
Chief Data Scientist at The New York Times & Associate Professor of Applied Mathematics at Columbia University
哒呵呵
2018/08/06
5730
How to Keep Up-to-Date as a Web Developer?
uptodate.jpg Stay Up To Date As A Software Developer. How to update yourself as a web/programming de
用户4822892
2019/11/12
4470
How to Keep Up-to-Date as a Web Developer?
萨提亚·纳德拉与沈向洋CVPR对谈:那些未来可期的计算机视觉研究与应用
编者按:6月16日,CVPR 2020 大会以全球连线的形式如期开幕。在大会的首场主题演讲中,微软公司 CEO 萨提亚·纳德拉与微软公司前执行副总裁沈向洋进行了一场精彩的炉边对谈,分享了对计算机视觉、人工智能研究与应用前景的思考与展望。本文为大家整理了完整的文字实录。
CV君
2020/06/28
6000
萨提亚·纳德拉与沈向洋CVPR对谈:那些未来可期的计算机视觉研究与应用
How AHI Fintech and DataVisor are Securing Data through AI and Big Data
The field of financial risk control has recently seen a sudden increase in competition over the past year. Several budding enterprises find themselves currently fighting a battle on two fronts—data acquisition capabilities and algorithm technology.
全栈程序员站长
2022/07/21
3830
Develop Custom VUI's for Children's Speech
Developers can now access child speech models, as well as Sensory’s industry-leading adult speech models, within Sensory’s VoiceHub developer portal.
用户6026865
2022/05/17
3210
Develop Custom VUI's for Children's Speech
I Gave Up Instagram and 30,000 Followers
平时有很多碎片化时间,比如下班的地铁上,或者等待的时间,我们总喜欢拿出手机玩,这个时间也可以用来学习呢,当然佳爷自己也想学习英语,所以上下班的时间看看。
仇诺伊
2020/05/29
4620
追求卓越,勇攀高峰 - RWP中国之旅盛大来袭
编辑手记:3月28日,Oracle RWP 性能之旅,北京站再度来袭!Andrew Holdsworth 和 Graham Wood 将带领大家在一天之内,探秘 OLTP 并分享现实世界中性能表现及监控案例,这是继2015年8月 Tom 大师归隐之后,RWP 团队第一次公开亮相! 我相信每一个技术爱好者都对此次的RWP中国之旅充满了期待,而我们有幸邀请到RWP团队全球副总裁Andrew Holdsworth在会前跟大家做个简单交流互动。 Andrew Holdsworth Oracle Real
数据和云
2018/03/07
7590
追求卓越,勇攀高峰 - RWP中国之旅盛大来袭
Introducing SensoryCloud.ai: Flexibility
After a quarter century of running embedded or “on the Edge” Sensory is moving into the cloud with the opportunity to offer hybrid solutions with more Flexibility, Accuracy, Features/Technologies, Privacy and Cost advantages than ever before.
用户6026865
2022/04/02
2230
Introducing SensoryCloud.ai:  Flexibility
Voice Assistants…What’s Going On
Amazon is on track to lose $10B on its devices group, which includes Alexa, and massive layoffs have been announced targeting the Alexa team. Google Assistant Actions and Driving Mode have been shut down amidst rumors of layoffs and re-prioritizing the Google Assistant and AI functions to make their in-house hardware better.
用户6026865
2023/03/03
3580
Voice Assistants…What’s Going On
Sensory&Philips-Enhance ASR with Speech Enhancement
Sensory, a Silicon Valley company enhancing user experience and security for consumer electronics, announced today its collaboration with Philips, a provider of advanced speech enhancement technologies, to offer a combined technology suite. This would package Sensory’s best-in-class speech recognition technologies TrulyHandsfree™ and TrulyNatural™ with Philips BeClear Speech Enhancement™ algorithms, resulting in significant accuracy improvement in noisy environments. By processing an audio signal with Philips’ echo cancellation, noise suppression and/or beam-forming processors before passing it to Sensory’s speech recognition engine, much of the unwanted ambient noise in a signal can be filtered out, leaving the critical speech portion of the signal largely untouched. This process allows Sensory’s already noise robust speech recognizer to decipher near- and far-field speech more accurately in conditions where very high ambient noise is present.
用户6026865
2022/09/02
5050
Sensory&Philips-Enhance ASR with Speech Enhancement
饮食行业的Voice-First变革
原文链接如下 - https://www.qsrmagazine.com/outside-insights/voice-first-revolution-takes-shape-restaurants
用户6026865
2019/09/16
6230
饮食行业的Voice-First变革
推荐阅读
相关推荐
专访 - Sensory CEO Todd Mozer - AI, 3D人脸识别以及其他
更多 >
领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档