面部年龄估计：利用标签分布学习方法

PDF文件

1.27MB | 更新于2024-08-26 | 139 浏览量 | 举报收藏

立即下载

"这篇研究论文探讨了一种新的面部年龄估计方法，即通过标签分布学习来改进算法的效果。在传统的面部年龄估计中，由于缺乏充足且完整的训练数据，学习算法面临困难。然而，由于老化过程是缓慢且连续的，相近年龄的面部图像看起来很相似。论文提出将每个面部图像看作与一个标签分布相关的实例，而不是单一的年龄标签。这种标签分布覆盖了多个类标签，表示每个标签描述实例的程度。这样，一张人脸图像不仅能为其实际年龄的学习提供信息，还能帮助学习其相邻年龄段的信息。文章提出了两种算法——IIS-LLD（迭代信息共享-标签分布学习）和CPNN（协同概率神经网络），用于从这样的标签分布中学习。实验结果表明，提出的标签分布学习算法在两个老化面部数据库上的性能显著优于传统单标签学习算法，无论是专门为年龄估计设计的还是通用的。" 面部年龄估计是一个复杂的问题，因为人的老化过程不是线性的，而且每个人的衰老速度和特征都有所不同。这篇研究论文的核心创新在于引入了标签分布的概念，以解决训练数据不足和年龄估计不精确的问题。标签分布学习允许模型不仅考虑每个样本的确切年龄，还考虑了样本可能属于的年龄范围，从而增强了模型的泛化能力。 IIS-LLD算法是一种迭代信息共享的方法，它在处理不确定性时能更好地捕捉到年龄估计中的模糊性和不确定性。通过在训练过程中考虑不同年龄之间的关联，IIS-LLD能够逐渐优化模型对年龄组的预测。另一方面，CPNN（协同概率神经网络）则是利用神经网络架构，考虑了面部特征之间的相互作用和年龄之间的连续性。通过协同学习，CPNN可以捕获到面部特征与年龄之间的复杂关系，提高年龄预测的准确性。实验部分展示了这两种算法在真实数据集上的效果，证明了标签分布学习在面部年龄估计任务中的优越性。这些发现对于改进人脸识别技术、年龄检测以及相关的人工智能应用具有重要意义，例如，用于个性化推荐、广告定向或者生物识别系统等。这项工作为面部年龄估计提供了一个新的视角和有效的方法，有望推动该领域的进一步发展。

to determine the probability (or confidence) of a label. In

most cases, it relies on the prior knowledge of the human

experts, which is a highly subjective and variable process.

As a result, the problem of learning from probabilistic

labels has not been extensively studied to date. Fortunately,

although not a probability by definition, d

xxxx

still shares the

same constraints with p robability, i.e., d

xxxx

2½0; 1 and

xxxx

¼ 1. Thus, many theories and methods in statistics

can be applied to label distributions.

It is also worthwhile to distinguish description degree

from the concept membership used in fuzzy classification [42].

Membership is a truth value that may range between

completely true and completely false. It is designed to

handle the status of partial truth that often appears in the

nonnumeric linguistic variables. For example, the age 25

might have a membership of 0.7 to the linguistic category

“young” and 0.3 to “middle age.” But for a particular face,

its association with the chronological age 25 will be either

completely true or completely false. On the other hand,

description degree reflects the ambiguity of the class

description of the instance, i.e., one class label may only

partially describe the instance. For example, due to the

appearance similarity of the neighboring ages, both the

chronological age 25 and the neighboring ages 24 and

26 can be used to describe the appearance of a 25-year-old

face. For each of 24, 25, and 26, it is completely true that it

can be used to describe the face (in the sense of appearance).

Each age’s description degree indicates how much the age

contributes to the full class description of the face.

The prior label distribution assigned to a face image at

the chronological age  should satisfy the following two

properties: 1) The description degree of  is the highest in

the label distribution, which ensures the leading position

of the chronological age in the class description;

2) the description degree of other ages decreases with the

increase of the distance away from , which makes the age

closer to the chronological age contribute more to the class

description. While there are many possibilities, Fig. 3

shows two kinds of prior label distributions for the images

at the chronological age , i.e., the Gaussian distribution

and the tr iangle distribution. Note that the age y is

regarded as a discrete class label in this paper, while both

the Gaussian and triangle distributions are defined by

continuous density functions pðyÞ. Directly letting d

xxxx

¼ pðyÞ

might induce

xxxx

6¼ 1. Thus, a normalization process

xxxx

¼ pðyÞ=

pðyÞ is required to ensure

xxxx

¼ 1.

3LEARNING FROM LABEL DISTRIBUTIONS

3.1 Problem Formulation

As mentioned before, many theories and methods from

statistics can be borrowed to deal with label distributions.

First of all, the description degree d

xxxx

could be represented

by the form of conditional probability, i.e., d

xxxx

¼ P ðy j xxxxÞ.

This might be explained as that given an instance xxxx, the

probability of the presence of y is equal to its description

degree. Then, the problem of label distribution learning can

be formulated as follows:

Let X¼IR

denote the input space and Y¼fy

; ...;y

denote the finite set of possible class labels. Given a training

set S ¼fðxxxx

Þ; ðxxxx

Þ; ...; ðxxxx

Þg, where xxxx

2X is an

instance, D

¼fd

xxxx

; ...;d

xxxx

g is the label distribution

associated with xxxx

, the goal of label distribution learning is

to learn a conditional probability mass function pðy j xxxxÞ from

S, where xxxx 2X and y 2Y.

For the problem of age estimation, suppose the same

shape of prior label distribution (e.g., Fig. 3) is assigned to

each face image; then the highest description degree for

each image will be the same, say, p

max

.Sincethe

description degree of the chronological age should always

be the highest in the label distribution, for a face image xxxx



at the chronological age , the label distribution learner

should output

pð j xxxx



Þ¼p

max

; ð1Þ

pð þ  j xxxx



Þ¼p

max

 p



; ð2Þ

where p



2½0; 1 is the description degree difference from

max

when the age changes to a neighboring age  þ .

Similarly, for a face image xxxx

þ

at the chronological age

 þ :

pð þ  j xxxx

þ

Þ¼p

max

: ð3Þ

As mentioned before, the faces at the close ages are quite

similar, i.e., xxxx

þ

 xxxx



; thus,

pð þ  j xxxx

þ

Þpð þ  j xxxx



Þ: ð4Þ

So, p



is a small positive number, which indicates that

pð þ  j xxxx



Þ is just a little bit smaller than pð j xxxx



Þ. Note

that the above analysis does not depend on any particular

form of the prior label distribution except that it must

satisfy the two properties mentioned in Section 2. This

proves that when applied to age estimation, label distribu-

tion learning tends to learn the similarity among the

neighboring ages, no matter what the (reasonable) prior

label distribution might be.

Suppose pðy j xxxxÞ is a parametric model pðy j xxxx; Þ, where

 is the vector of the model parameters. Given the training

set S, the goal of label distribution learning is to find the 

that can generate a distribution similar to D

given the

instance xxxx

. If the Kullback-Leibler divergence is used as

the measurement of the similarity between two distribu-

tions, then the best model parameter vector 



determined by





¼ argmin



xxxx

pðy

j xxxx

; Þ



¼ argmax



xxxx

ln pðy

j xxxx

; Þ:

ð5Þ

It is interesting to examine the traditional learning

paradigms under the optimization criterion shown in (5).

For single-label learning (see Fig. 2a), d

xxxx

¼ Krðy

;yðxxxx

ÞÞ,

GENG ET AL.: FACIAL AGE ESTIMATION BY LEARNING FROM LABEL DISTRIBUTIONS 2403

Fig. 3. Typical label distributions for the image at the chronological

age : (a) Gaussian distribution and (b) triangle distribution.

剩余11页未读，继续阅读

weixin_38735182

粉丝: 5

面部年龄估计：利用标签分布学习方法

最新标记分布学习matlab代码

利用稳健标签分布进行面部年龄估计

极限学习机的面部年龄估计的一种有效方法

成本敏感标签排名和痕量规范化对人类面部年龄的估计

深度学习的年龄估计：训练CNN

基于结构化低秩表示的面部年龄估计

Python-基于PyTorch的CNN实现用于从面部图像估计年龄

DLDL:具有标签歧义的深度标签分发学习

统一的性别感知年龄估计

android_tflite:Android NDK上的GPU加速TensorFlow Lite应用程序。 更高精确度的面部检测，年龄和性别估计，人体姿势估计，艺术风格转换

FG-NET年龄估计数据库标签解析

使用Keras构建CNN进行深度学习年龄估计

基于面部图像的年龄和性别估计技术

面部变形与年龄估计的自动化检测系统评估

通过深度学习训练CNN实现年龄识别技术

深度学习在面部特征估计中的新进展：DLDL方法解析

深度学习识别年龄

计算机网络学习中学员常见问题与改进方法

如何让群晖使用Zerotier构建虚拟局域网

神经网络AI玩21点棋牌游戏

最新资源

android_tflite:Android NDK上的GPU加速TensorFlow Lite应用程序。更高精确度的面部检测，年龄和性别估计，人体姿势估计，艺术风格转换