一种高效且轻量的RGB单帧人脸反欺诈模型

doi:10.6040/j.issn.1672-3961.0.2023.132

摘要/Abstract

摘要：

针对在仅具有三原色(red-green-blue，RGB)摄像头的通用消费设备上部署基于深度学习的人脸反欺诈(face anti-spoofing，FAS) 算法时存在的挑战问题，提出一种高效且轻量的RGB单帧FAS(efficient and lightweight RGB frame-level face anti-spoofing，EL-FAS)模型。探索一种新的全局空间自注意力机制捕获全局上下文信息的依赖关系，以提高模型泛化能力并在受限条件下实现高检测性能；设计一种等通道像素级二元监督方法，强制模型从不同的像素中学习共享特征；采用Bottleneck模块搭建骨干网络以减少模型参数。试验结果表明，EL-FAS模型在OULU-NPU数据集的大多数协议上平均分类错误率R_ACE最低，取得较好的人脸欺诈检测效果，在SiW数据集和跨数据集测试中也取得较好的性能，并且模型轻量，参数只有1.34×10⁶个。

关键词: 深度学习, 人脸反欺诈, 自注意力机制, 像素级监督, 轻量级模型

Abstract:

Based on the challenge when deploying a deep learning-based face anti-spoofing (FAS) algorithm on general-purpose consumer devices with only RGB camera, an efficient and lightweight RGB frame-level FAS model (EL-FAS) was proposed. To improve the generalization ability of the model and achieve high performance under constrained conditions, a novel global spatial self-attention mechanism was explored to capture global feature dependencies, and an equal-channel pixel-wise binary supervision method was designed to force our model to learn shared features from different pixels. The Bottlenecks residual block was used to establish the backbone network to reduce parameters. Analysis and the experimental results showed that EL-FAS model achieved state-of-the-art performance in the OULU-NPU dataset, obtained competitive performance in the SiW dataset and cross-dataset tests. The model was lightweight, with only 1.34×10⁶ parameters.

Key words: deep learning, face anti-spoofing, self-attention mechanism, pixel-wise supervision, lightweight model

中图分类号:

TP391

李家春,李博文,常建波. 一种高效且轻量的RGB单帧人脸反欺诈模型[J]. 山东大学学报 (工学版), 2023, 53(6): 1-7.

Jiachun LI,Bowen LI,Jianbo CHANG. An efficient and lightweight RGB frame-level face anti-spoofing model[J]. Journal of Shandong University(Engineering Science), 2023, 53(6): 1-7.

图/表 11

图1

表1

图2

表2

表3

表4

表5

表6

表7

表8

表9

参考文献 20

1	PATEL K , HAN H , JAIN A K . Secure face unlock: spoof detection on smartphones[J]. IEEE Transactions on Information Forensics and Security, 2016, 11 (10): 2268- 2283. doi: 10.1109/TIFS.2016.2578288
2	KOMULAINEN J, HADID A, PIETIKÄINEN M. Context based face anti-spoofing[C]//Proceedings of the 2013 IEEE Sixth International Conference on Biometrics: Theory, Applications and Systems (BTAS). Arlington, USA: IEEE, 2013: 6712690.
3	YANG J, LEI Z, LI S Z. Learn convolutional neural network for face anti-spoofing[EB/OL]. (2014-08-26)[2023-02-21]. https://arxiv.org/abs/1408.5601.
4	XU Z, LI S, DENG W. Learning temporal features using LSTM-CNN architecture for face anti-spoofing[C]//Proceedings of the 2015 3rd IAPR Asian Conference on Pattern Recognition(ACPR). Kuala Lumpur, Malaysia: IEEE, 2015: 141-145.
5	WANG Z, YU Z, ZHAO C, et al. Deep spatial gradient and temporal depth learning for face anti-spoofing[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle, USA: IEEE, 2020: 5042-5051.
6	ZHANG S, WANG X, LIU A, et al. A dataset and benchmark for large-scale multi-modal face anti-spoofing[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach, USA: IEEE, 2019: 919-928.
7	WANG Z , WANG Q , DENG W , et al. Learning multi-granularity temporal characteristics for face anti-spoofing[J]. IEEE Transactions on Information Forensics and Security, 2022, 17, 1254- 1269. doi: 10.1109/TIFS.2022.3158062
8	ATOUM Y, LIU Y, JOURABLOO A, et al. Face anti-spoofing using patch and depth-based CNNs[C]//Proceedings of the 2017 IEEE International Joint Conference on Biometrics (IJCB). Denver, USA: IEEE, 2017: 319-328.
9	KIM T, KIM Y H, KIM I, et al. BASN: enriching feature representation using bipartite auxiliary supervisions for face anti-spoofing[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops. Seoul, Korea: IEEE, 2019: 494-503.
10	GEORGE A, MARCEL S. Deep pixel-wise binary supervision for face presentation attack detection[C]// Proceedings of the 2019 International Conference on Biometrics (ICB). Crete, Greece: IEEE, 2019: 19352833.
11	RONNEBERGER O, FISCHER P, BROX T. U-Net: convolutional networks for biomedical image segmentation[C]//Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention. Munich, Germany: Springer, 2015: 234-241.
12	SANDLER M, HOWARD A, ZHU M, et al. MobileNetV2: inverted residuals and linear bottlenecks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City, USA: IEEE, 2018: 4510-4520.
13	VISIN F, KASTNER K, CHO K, et al. ReNet: a recurrent neural network based alternative to convolutional networks[EB/OL]. (2015-07-23)[2023-02-21]. https://arxiv.org/abs/1505.00393.
14	BOULKENAFET Z, KOMULAINEN J, LI L, et al. OULU-NPU: a mobile face presentation attack database with real-world variations[C]//Proceedings of the 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017). Washington, USA: IEEE, 2017: 612-618.
15	CHINGOVSKA I, ANJOS A, MARCEL S. On the effectiveness of local binary patterns in face anti-spoofing[C]//2012 BIOSIG-Proceedings of the International Conference of Biometrics Special Interest Group (BIOSIG). Darmstadt, Germany: IEEE, 2012: 13029854.
16	ZHANG Z, YAN J, LIU S, et al. A face anti-spoofing database with diverse attacks[C]//Proceedings of the 2012 5th IAPR International Conference on Biometrics (ICB). New Delhi, India: IEEE, 2012: 26-31.
17	LIU Y, JOURABLOO A, LIU X. Learning deep models for face anti-spoofing: binary or auxiliary supervision[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Salt Lake City, USA: IEEE, 2018: 389-398.
18	BOULKENAFET Z, KOMULAINEN J, AKHTAR Z, et al. A competition on generalized software-based face presentation attack detection in mobile scenarios[C]//Proceedings of the 2017 IEEE International Joint Conference on Biometrics (IJCB). Denver, USA: IEEE, 2017: 688-696.
19	JOURABLOO A, LIU Y, LIU X. Face de-spoofing: anti-spoofing via noise modeling[C]//Proceedings of the European Conference on Computer Vision (ECCV). Munich, Germany: IEEE, 2018: 290-306.
20	LIU Y, STEHOUWER J, LIU X. On disentangling spoof trace for generic face anti-spoofing[C]//Proceedings of the European Conference on Computer Vision. Glasgow, England: Springer, 2020: 406-422.

相关文章 15

[1]	王禹鸥,苑迎春,何振学,何晨. 融合多特征和多头自注意力机制的高校学业命名实体识别[J]. 山东大学学报 (工学版), 2025, 55(6): 35-44.
[2]	李常刚,李宝亮,曹永吉,王佳颖. 人工智能在电力系统潮流计算中的应用综述及展望[J]. 山东大学学报 (工学版), 2025, 55(5): 1-17.
[3]	周群颖,隋家成,张继,王洪元. 基于自监督卷积和无参数注意力机制的工业品表面缺陷检测[J]. 山东大学学报 (工学版), 2025, 55(4): 40-47.
[4]	薛冰冰,王勇,杨维浩,王川,于迪,王旭. 基于ETC收费数据的高速公路交通流数据修复及实时预测[J]. 山东大学学报 (工学版), 2025, 55(3): 58-71.
[5]	董明书,陈俐企,马川义,张珠皓,孙仁娟,管延华,庄培芝. 沥青路面内部裂缝雷达图像智能判识算法研究[J]. 山东大学学报 (工学版), 2025, 55(3): 72-79.
[6]	常新功,苏敏惠,周志刚. 基于进化集成的图神经网络解释方法[J]. 山东大学学报 (工学版), 2024, 54(4): 1-12.
[7]	索大翔,李波. 基于Gromov-Wasserstein最优传输的输电线路小目标检测方法[J]. 山东大学学报 (工学版), 2024, 54(3): 22-29.
[8]	宋辉,张轶哲,张功萱,孟元. 基于类权重和最小化预测熵的测试时集成方法[J]. 山东大学学报 (工学版), 2024, 54(3): 36-43.
[9]	刘新,刘冬兰,付婷,王勇,常英贤,姚洪磊,罗昕,王睿,张昊. 基于联邦学习的时间序列预测算法[J]. 山东大学学报 (工学版), 2024, 54(3): 55-63.
[10]	聂秀山,巩蕊,董飞,郭杰,马玉玲. 短视频场景分类方法综述[J]. 山东大学学报 (工学版), 2024, 54(3): 1-11.
[11]	李璐,张志军,范钰敏,王星,袁卫华. 面向冷启动用户的元学习与图转移学习序列推荐[J]. 山东大学学报 (工学版), 2024, 54(2): 69-79.
[12]	高泽文,王建,魏本征. 基于混合偏移轴向自注意力机制的脑胶质瘤分割算法[J]. 山东大学学报 (工学版), 2024, 54(2): 80-89.
[13]	陈成,董永权,贾瑞,刘源. 基于交互序列特征相关性的可解释知识追踪[J]. 山东大学学报 (工学版), 2024, 54(1): 100-108.
[14]	王旭晴,魏伟波,杨光宇,宋金涛,吕婷,潘振宽. 基于算法展开的图像盲去模糊深度学习网络[J]. 山东大学学报 (工学版), 2023, 53(6): 35-46.
[15]	王碧瑶,韩毅,崔航滨,刘毅超,任铭然,高维勇,陈姝廷,刘嘉巍,崔洋. 基于图像的道路语义分割检测方法[J]. 山东大学学报 (工学版), 2023, 53(5): 37-47.

多维度评价

Viewed

Full text

Abstract

Cited

Shared

Discussed

解编码器	模块	输入	操作	t	c	s
编码器	E₁	3×224²	Conv3×3		32	2
	E₁	32×112²	Bottleneck	1	16	1
	E₂	16×56²	Bottleneck	6	24	2
		24×56²	Bottleneck	6	24	1
		24×56²	Bottleneck	6	32	2
	E₃	32×28²	Bottleneck	6	32	1
		32×28²	Bottleneck	6	32	1
		32×28²	Bottleneck	6	64	2
		64×14²	Bottleneck	6	64	1
		64×14²	Bottleneck	6	64	1
	E₄	64×14²	Bottleneck	6	64	1
		64×14²	Bottleneck	6	96	1
		96×14²	Bottleneck	6	96	1
		96×14²	Bottleneck	6	96	1
解码器		96×14²	ConvTrans		32	2
	D₁	64×28²	Bottleneck	6	32	1
		32×28²	Bottleneck	6	32	1
		32×28²	ConvTrans		24	2
	D₂	48×56²	Bottleneck	6	24	1
		24×56²	Bottleneck	6	24	1
		24×56²	ConvTrans		16	2
	D₃	32×112²	Bottleneck	6	16	1
		16×112²	Bottleneck	6	16	1
	D₄	16×112²	ConvTrans		16	2
	D₄	16×224²	Conv3×3		3	1

数据集	用户数	视频数	攻击设备	测试协议数
SiW	165	4 478	iPad, iPhone7, GalaxyS8, Asus MB168B(RGB)	3
OULU-NPU	55	4 950	Dell 1905FP, MacBook(RGB)	4
Replay-Attack	50	1 200	iPad, iPhone 3GS(RGB)	1
CASIA-FASD	50	600	A4 Paper, iPad(RGB)	1

模型	R_APCE	R_BPCE	R_ACE
Single	4.2±9.3	4.6±6.8	4.4±4.9
Equal	3.3±4.7	5.4±6.8	4.4±3.2
Single+Attention	3.4±4.5	4.4±6.8	3.9±3.5
Equal+Attention	1.6±3.7	5.8±8.3	3.7±4.0

模型	R_APCE	R_BPCE	R_ACE
Single	3.3±3.7	2.5±2.5	2.9±2.7
Equal	3.0±4.0	2.5±2.2	2.8±2.0
Single+Attention	3.0±4.0	2.0±1.9	2.5±1.8
Equal+Attention	0.8±1.8	3.7±4.3	2.3±2.4

协议	方法	R_APCE	R_BPCE	R_ACE
1	GRADIANT^[18]	1.3	12.5	6.9
	Auxiliary^[17]	1.6	1.6	1.6
	FaceDs^[19]	1.2	1.7	1.5
	STDN^[20]	0.8	1.3	1.1
	DSGT^[5]	2.0	0	1.0
	DeepPix^[10]	0.8	0	0.4
	EL-FAS	0	1.6	0.8
2	DeepPix^[10]	11.4	0.6	6.0
	FaceDs^[19]	4.2	4.4	4.3
	Auxiliary^[17]	2.7	2.7	2.7
	GRADIANT^[18]	3.1	1.9	2.5
	DSGT^[5]	2.5	1.3	1.9
	STDN^[20]	2.3	1.6	1.9
	EL-FAS	0.3	0.5	0.4
3	DeepPix^[10]	11.7±19.6	10.6±14.1	11.1±9.4
	GRADIANT^[18]	2.6±3.9	5.0±5.3	3.8±2.4
	Auxiliary^[17]	2.6±3.9	5.0±5.3	3.8±2.4
	FaceDs^[19]	4.0±1.8	3.8±1.2	3.6±1.6
	STDN^[20]	1.6±1.6	4.0±5.4	2.8±3.3
	DSGT^[5]	3.2±2.0	2.2±1.4	2.7±0.6
	EL-FAS	1.9±2.4	3.5±2.4	2.7±1.2
4	DeepPix^[10]	36.7±29.7	13.3±16.8	25.0±12.7
	GRADIANT^[18]	5.0±4.5	15.0±7.1	10.0±5.0
	Auxiliary^[17]	9.3±5.6	10.4±6.0	9.5±6.0
	FaceDs^[19]	5.1±6.3	6.1±5.1	5.6±5.7
	DSGT^[5]	6.7±7.5	3.3±4.1	5.0±2.2
	STDN^[20]	2.3±3.6	5.2±5.4	3.8±4.2
	EL-FAS	1.6±3.7	5.8±8.3	3.7±4.0