Loading [MathJax]/jax/output/CommonHTML/config.js
前往小程序,Get更优阅读体验!
立即前往
首页
学习
活动
专区
圈层
工具
发布
首页
学习
活动
专区
圈层
工具
MCP广场
社区首页 >专栏 >Sensory's VOICE BIOMETRIC REVOLUTION

Sensory's VOICE BIOMETRIC REVOLUTION

作者头像
用户6026865
发布于 2023-03-02 13:21:05
发布于 2023-03-02 13:21:05
3060
举报

- VOICE BIOMETRIC REVOLUTION: -

WHY VOICE ID IS NOW SECURE ENOUGHFOR DEVICE UNLOCK

INTRODUCTION

With the ever-increasing reliance on smartphones, laptops, and smart IoT devices comes a growing need to protect personal data and applications.

Consequently, our devices must implement suitably strong measures to guard against fraudulent access. Until now, the security standard for unlocking a mobile device or laptop exceeded what was possible to achieve with spoken voice, relegating voice to a useful convenience for some functions but not sufficient for completely unlocking a device with full access at the same level as a 4-digit PIN, a fingerprint, or a face match.

However, there are numerous circumstances where a user needs hands-free access and cannot touch or look at the device directly.

Examples include driving, cooking, exercising, or even working in an environment that requires gloves and other personal protective equipment. And, there are several cases where voice is the only means of interaction. Therefore, using spoken voice to unlock a device and enable a full set of voice commands is highly desirable.

DEVICE UNLOCK SECURITY OVERVIEW

There are different ways to protect a personal device against fraudulent access.

The most common are PIN codes and biometric-based methods such as fingerprints and facial recognition.

The probability of accepting an impostor with a 4-digit PIN code is 1 in 10,000.

The accuracy requirements when using biometrics for mobile device unlock are similarly high.

The Android Compatibility Definition Document (CDD) requires the false acceptance rate of fraudsters to not exceed 1 in 50,000 with a maximum false reject rate of 10%, as well as requiring support for spoofing detection.

An example of a biometric that meets or exceeds this standard is face matching. The technology that enables Apple’s Face ID utilizes advanced hardware and software. The iPhone camera creates a depth map of yourface while also capturing an infrared image, while Apple puts the probability of a random person looking at your iPhone and unlocking it using Face ID at approximately 1 in 1,000,000.

DISADVANTAGES OF CURRENT SOLUTIONS

While face and fingerprint biometrics offer strong device-based security, there are cases when a user would benefit from hands-free access to their locked device.

An obvious example is while driving. In this case, typing PIN or using facial or fingerprint biometrics is not safe as require the driver to interact with the device’s touch screen or to position their face in front of the camera, diverting attention from the task of driving.

Voice biometrics offer a passive interaction without diversion. The user can unlock the device and perform a task with a short spoken phrase such as «Ok, Google, read my last text message.»

However, the advantages of offering a voice-based option extend beyond the need for hands-free access for safety.

First, unlike alternative methods of unlocking a device, only voice offers the ability to unlock a secure device and execute a command such as «read my last text message», in one simple step.

Second, the user environment may be better suited to voice, such as in the case of poor lighting conditions for face capture and wet or dirty fingers for fingerprint capture. Voice now provides a secure and convenient alternative.

Finally, unlike other biometric modalities, voice biometrics doesn’t require sophisticated-device-specific fingerprint or camera sensors and will work even with low-end devices.

Voice unlock can now be used, with billions of existing devices without additional hardware costs. The net result is that voice extends the value of the device to more situations.

THE SENSORY SOLUTIONS

The scientific community has recently made advancements in the voice recognition space, more precisely called the speaker verification space.

The voice modality offers a desirable approach to handling user authentication for device unlock, payments, and other activities that require high security.

However, there are currently no commercially available solutions on the market that enable secure device unlock using voice. Until now, the accuracy of voice biometrics has not met accepted standards.

Sensory made significant progress in the use of voice biometrics for device unlock use cases. Sensory accomplished this breakthrough by combining its advanced algorithms with multiple speaker verification methods and its unique voice anti-spoofing technology. The approach is described in the following paragraphs.

The main methods for authenticating a person’s unique voiceprint can be divided into two categories: text-dependent and text-independent.

In a text-dependent approach, the analyzed phrases are fixed and known beforehand. Conversely, the text-independent approach places no constraints on the words which the user is allowed to speak for authentication.

Both approaches have pros and cons. But unique capabilities arise when combining the two approaches in a natural voice user interface interaction.

For instance, voice interactions with personal electronic devices commonly start with a fixed wake-up word such as «Ok, Google», «Hi, Alexa», or «Hey, Siri».

A wake-up word alerts the device to listen for a command phrase. The actual command phrase is not fixed and thus will not be handled by the text-dependent approach. The command phrase could be something like: «What time is my next meeting?» or «Venmo 10 dollars to Alex for lunch».

The proposed solution applies text-dependent speaker recognition for the wake-up word, text-independent speaker recognition for the command/question, and voice anti-spoofing technology for the entire utterance.

The matching results are combined to provide an authentication decision. The voice anti-spoofing algorithm protects the system from spoofing attacks. The types of voice spoofing attacks covered are:

The combination of speaker verification methods and anti-spoofing results in a high level of authentication accuracy with a False Acceptance Rate below 1 in 50,000, a False Rejection Rate below 10%, and a Spoofing Acceptance Rate as low as 3%.

The uniqueness of the technology lies in using a Common Deep Neural Network processing step that enables the extraction of robust features from the voice for text-dependent, text-independent, and anti-spoofing within a single network.

Combining datasets that were previously used separately for these tasks doubles the training set and enables a synergistic effect for each task and the authentication task in particular.

DIVERSITY OF EVALUATION DATA

Sensory pays significant attention to the diversity of data and is guided by industry standards.

Our evaluation methodology and accuracy metrics were defined according to the ISO standards: ISO/IEC 19795-1 (biometric performance testing and reporting), ISO/IEC 30107-3 (biometric presentation attack detection), and ISO/IEC TR19795-3 (biometric performance testing and reporting).

The following principle factors are accurately measured and taken into account:- Biological factors: age distribution, gender-Social factors: language- Environmental factors: noise level and type, transmission channel, reverberation. Additionally, one more biological factor is taken into account - inter-day voice variability.

This is the variability of invoice characteristics caused by changes in a user’s emotional and physical state across different days.

Sensory's data comprises up to five different data sources: data collection services with fully controlled conditions of data gathering, individual subcontractors, crowd collection services with uncontrolled conditions,our data collection department, and data from partners.

There are 10 different languages, including European and Asian languages, 10k speakers, 5 environments, and near-, mid- and far-field conditions.

EVALUATION RESULTS

As with all biometric authentication systems, measuring the two types of errors, the False Accept Rate (FAR), the error of letting an impostor through, and the False Reject Rate (FRR), the error of blocking a valid person, constitute the basis for measuring accuracy.

A Detection Error Trade-off (DET) plot illustrates the trade-off between these two types of errors for a biometric matching system.

CONCLUSION

e, voice biometrics open up new possibilities for secure, hands-free access to information and applications on a variety of voice-enabled devices Examples of use cases for hands-free voice biometric unlock:

本文参与 腾讯云自媒体同步曝光计划,分享自微信公众号。
原始发表:2023-02-21,如有侵权请联系 cloudcommunity@tencent.com 删除

本文分享自 SmellLikeAISpirit 微信公众号,前往查看

如有侵权,请联系 cloudcommunity@tencent.com 删除。

本文参与 腾讯云自媒体同步曝光计划  ,欢迎热爱写作的你一起参与!

评论
登录后参与评论
暂无评论
推荐阅读
编辑精选文章
换一批
Assessing Biometric Authentication -A Holistic Approach
Biometric authentication is certainly starting to get the attention of the general public. Announcements like the revelation this past fall that over 1 billion stolen passwords had been amassed by a Russian crime ring underscore the fact that the current security systems are flawed, and that new approaches to security are necessary. There is a growing consensus in government and industry (and often confirmed in Hollywood) that biometric approaches are the best path forward. The push by Apple and Samsung to make fingerprint authentication available in their devices is among the most visible applications of biometrics.
用户6026865
2022/09/02
3090
Assessing Biometric Authentication -A Holistic Approach
Voice ID On-device - Embedded Secure Authentication Solution
Easy, Embedded and Secure Voice Biometric Authentication for Devices and Applications
用户6026865
2022/09/02
5100
Voice ID On-device - Embedded Secure  Authentication Solution
访谈 - Sensory CEO Todd Mozer与FindBiometrics CEO Peter O'Neil
Sensory CEO Todd Mozer近日接受了FindBiometrics CEO Peter O'Neil的专访。内容包括了 Sensory于2019年对Vocalize.ai,独立第三方语音和声音生物特征测试实验室的收购,以及包含语音识别和交互,面部识别和模拟的人的虚拟化身(virtual avatar)的应用,以及关于当但隐私保护的探讨等等。
用户6026865
2020/01/17
4330
Sensory TrulySecure - Easy, Embedded, Secure Authentication
Sensory TrulySecure Speaker Verification(TSSV)技术是独立于语言的(language independent),具备高度安全性和便利性的,设备端(on device)用户语音和短语(passphrase)验证技术。
用户6026865
2021/03/15
4650
Sensory TSSV - TrulySecureSpeakerVerificatio
TSSV-面向硬件设备和应用的嵌入式的和简单的安全验证(Secure Authentication)技术。
用户6026865
2020/08/17
6600
Sensory  TSSV - TrulySecureSpeakerVerificatio
SensoryCloud AI - 支持Liveness的声纹生物特征识别
Biometric data is the unique information that can be used to identify a person with accuracy. It includes uniquely identifiable features such as fingerprint, face recognition, iris, voice recognition. The increased acceptance of biometrics by consumers has encouraged the uptake of these systems on a wider scale.
用户6026865
2022/05/17
4430
专访 - Sensory CEO Todd Mozer - AI, 3D人脸识别以及其他
Sensory Inc.作为向全球移动设备提供先进的复杂生物识别算法的供应商,于近期展示了其采用面部和声音识别算法的AI虚拟银行助理技术。
用户6026865
2019/10/30
8130
专访 - Sensory CEO Todd Mozer - AI, 3D人脸识别以及其他
Sensory’s TrulyHandsfree and Arm’sCortex-M55
Efficient wake word recognition on microcontrollers with Cortex-M55 and Helium technology for use in consumer and automotive products that include more and more AI features for voice applications.
用户6026865
2022/09/02
3410
Sensory’s TrulyHandsfree and Arm’sCortex-M55
5 Predictions for Voice Technology in 2023
There is no doubt that voice is the most natural and convenient communication mode, so it's little wonder that the adoption of voice technology on smart devices has more recently become the preferred interface in many contexts.
用户6026865
2023/03/02
2970
5 Predictions for Voice Technology in 2023
Buy Now Pay Later, But At What Price? A Case for Face Biometrics
The speed at which we can buy and receive products has escalated with the rise of one click internet shopping, the gig economy for deliveries, and economies of scale through improved logistics, warehousing, and deliveries. The rise of robotic and drone deliveries is going to make it all the easier to get stuff fast. Helping the speed of buying is having our purchasing information stored on our computers and our phones. Buy Now Pay Later (BNPL) makes it all the faster because now we don’t even need to have the cash assets to buy things.
用户6026865
2022/05/17
2890
Buy Now Pay Later, But At What Price? A Case for Face Biometrics
DJI和GoPro运动相机语音控制对比和语音控制技术和创新应用的探讨
作为运动相机,必须要满足运动场景下的HANDS-FREE解放双手的操作,而语音则以用户最自然的方式,赋予用户直观,强大和自然的人机交互方式。
用户6026865
2020/09/29
1.7K0
DJI和GoPro运动相机语音控制对比和语音控制技术和创新应用的探讨
疫情期间戴口罩仍可识别的Sensory Biometric面部识别解决技术
Sensory TrulySecure人声和面部生物识别技术(face and voice biometrics)为用户带来极大的便利性,同时为用户在COVID-19新常态期间带来新价值 - 用户带口罩仍可正常识别,而且可以识别咳嗽和打喷嚏(cough and sneezes)。
用户6026865
2020/06/12
6930
疫情期间戴口罩仍可识别的Sensory Biometric面部识别解决技术
多模态PCANet:一种高精度、低复杂度的鲁棒3D活体检测方案
当下正值新冠肺炎(COVID-19)肆虐全球之际,戴口罩成为了全民阻断病毒传播的最佳方式。然而在人脸部分遮挡或恶劣光照条件下,用户人脸识别或人脸认证的合法访问常常提示活体检测失败,甚至根本检测不到人脸。这是由于目前基于RGB等2D空间的主流活体检测方案未考虑光照、遮挡等干扰因素对于检测的影响,而且存在计算量大的缺点。而数迹智能团队研发的3D SmartToF活体检测方案则可以有效解决此问题。那么什么是活体检测?什么又是3D活体检测?以及怎么实现恶劣环境(如人脸遮挡、恶劣光照等)与人脸多姿态变化(如侧脸、表情等)应用场景下的活体检测呢?本文将会围绕这些问题,介绍数迹智能的最新成果——基于ToF的3D活体检测算法。
3D视觉工坊
2020/11/11
1.5K0
多模态PCANet:一种高精度、低复杂度的鲁棒3D活体检测方案
Sensory生物识别技术 - 更安全,更便捷,最具成本优势
生物身份识别和验证技术讲究的是在易用性和识别准确性之间的平衡(conbination of convenience and accuracy)。
用户6026865
2020/12/14
5940
Sensory生物识别技术 - 更安全,更便捷,最具成本优势
Sensory&Philips-Enhance ASR with Speech Enhancement
Sensory, a Silicon Valley company enhancing user experience and security for consumer electronics, announced today its collaboration with Philips, a provider of advanced speech enhancement technologies, to offer a combined technology suite. This would package Sensory’s best-in-class speech recognition technologies TrulyHandsfree™ and TrulyNatural™ with Philips BeClear Speech Enhancement™ algorithms, resulting in significant accuracy improvement in noisy environments. By processing an audio signal with Philips’ echo cancellation, noise suppression and/or beam-forming processors before passing it to Sensory’s speech recognition engine, much of the unwanted ambient noise in a signal can be filtered out, leaving the critical speech portion of the signal largely untouched. This process allows Sensory’s already noise robust speech recognizer to decipher near- and far-field speech more accurately in conditions where very high ambient noise is present.
用户6026865
2022/09/02
5000
Sensory&Philips-Enhance ASR with Speech Enhancement
ST&Sensory&DSPC Joint Webiner
Customizable embedded voice recognition solutions without external connectivity
用户6026865
2023/03/02
3850
ST&Sensory&DSPC Joint Webiner
Introducing SensoryCloud.ai: Flexibility
After a quarter century of running embedded or “on the Edge” Sensory is moving into the cloud with the opportunity to offer hybrid solutions with more Flexibility, Accuracy, Features/Technologies, Privacy and Cost advantages than ever before.
用户6026865
2022/04/02
2170
Introducing SensoryCloud.ai:  Flexibility
CV学习笔记(二十八):活体检测总结②
和传统的方法结构类似,只是使用了VGG进行特征提取,通过CNN网络端到端学习anti-spoofing的表示空间
云时之间
2020/07/22
1.3K0
Anti-Spoofing之人脸活体检测
每周精选 Algorithm System Anti-Spoofing 之人脸活体检测 在小编之前的文章系列中曾介绍过的对抗样本攻击,是目前Deep Learning比较火热的一个研究方向,因为它掀起了关注深度学习在安全领域潜在问题的热潮。虽然活跃于学术界的对抗样本目前还未渗入到工业界中,anti-spoofing(反欺诈)仍一直是大家关注的焦点。人脸识别是大家最为熟悉的应用深度学习的例子,结合人脸识别技术的APP在市面上比比皆是,本文将简单介绍在人脸识别应用中的反欺诈技术——人脸活体检测。 人脸识别,
企鹅号小编
2018/01/29
5.3K0
重磅纯干货 | 超级赞的语音识别/语音合成经典论文的路线图(1982-2018.5)
网址:https://github.com/zzw922cn/awesome-speech-recognition-speech-synthesis-papers
用户7623498
2020/08/04
1.3K0
推荐阅读
相关推荐
Assessing Biometric Authentication -A Holistic Approach
更多 >
领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档