使用VGG16模型和余弦相似度进行图像特征提取与比较

最新推荐文章于 2025-04-28 13:29:19 发布

蜡笔小新星

最新推荐文章于 2025-04-28 13:29:19 发布

阅读量735

点赞数 6

CC 4.0 BY-SA版权

文章标签： neo4j python 开发语言学习经验分享

本文链接：https://blog.csdn.net/m0_54490473/article/details/143894795

文章目录

在图像处理与计算机视觉领域，特征提取是一项至关重要的任务。它能帮助我们识别图像中的关键信息，并用于诸如图像分类、检索及识别等应用场景。本教程将指导你如何使用预训练的VGG16模型提取图像特征，并通过计算余弦相似度来比较这些特征。

准备工作

首先，你需要确保安装了以下Python库：

TensorFlow（含Keras API）
NumPy
scikit-learn
PIL（Python Imaging Library，即Pillow）

你可以使用pip命令来安装这些库（如果尚未安装）：

pip install tensorflow numpy scikit-learn pillow

步骤一：加载预训练的VGG16模型

VGG16是一个在大型图像数据集（如ImageNet）上预训练的深度卷积神经网络。在此教程中，我们将使用不包含全连接层的VGG16模型，以提取图像的高级特征。

from tensorflow.keras.applications.vgg16 import VGG16

# 加载预训练的VGG16模型，不包括全连接层，并启用平均池化
model = VGG16(weights='imagenet', include_top=False, pooling='avg')

步骤二：预处理图像

为了使VGG16模型能够处理你的图像，你需要将其调整为224x224像素大小，并进行归一化处理。以下是一个预处理图像的辅助函数：

from tensorflow.keras.preprocessing import image
from tensorflow.keras.applications.vgg16 import preprocess_input
import numpy as np

def preprocess_image(img_path):
    # 加载并调整图像大小
    img = image.load_img(img_path, target_size=(224, 224))
    # 将图像转换为NumPy数组
    img_array = image.img_to_array(img)
    # 扩展维度以匹配模型输入格式（批处理大小，高度，宽度，通道数）
    img_array = np.expand_dims(img_array, axis=0)
    # 进行预处理（归一化）
    img_array = preprocess_input(img_array)
    return img_array

步骤三：提取图像特征

接下来，我们将使用预处理后的图像和VGG16模型来提取特征。

def extract_features(img_path, model):
    # 预处理图像
    img_array = preprocess_image(img_path)
    # 使用模型进行预测，得到特征向量
    features = model.predict(img_array)
    # 将特征向量展平为一维数组
    return features.flatten()

步骤四：计算余弦相似度

余弦相似度是一种衡量两个向量方向相似度的指标。在此教程中，我们将使用scikit-learn库中的cosine_similarity函数来计算两个特征向量之间的余弦相似度。

from sklearn.metrics.pairwise import cosine_similarity

def calculate_cosine_similarity(features1, features2):
    # 计算两个特征向量之间的余弦相似度
    return cosine_similarity([features1], [features2])[0][0]

步骤五：示例使用

现在，你可以使用上述函数来比较两张图像的特征了。

# 指定图像路径
img_path1 = 'path_to_image1.jpg'
img_path2 = 'path_to_image2.jpg'

# 提取图像特征
features1 = extract_features(img_path1, model)
features2 = extract_features(img_path2, model)

# 计算并打印余弦相似度
similarity_score = calculate_cosine_similarity(features1, features2)
print(f"Cosine Similarity: {similarity_score}")