Model Provenance via Model DNA

Mu, Xin; Wang, Yu; Zhang, Yehong; Zhang, Jiaqi; Wang, Hui; Xiang, Yang; Yu, Yue

Computer Science > Machine Learning

arXiv:2308.02121 (cs)

[Submitted on 4 Aug 2023 (v1), last revised 18 Jul 2024 (this version, v3)]

Title:Model Provenance via Model DNA

Authors:Xin Mu, Yu Wang, Yehong Zhang, Jiaqi Zhang, Hui Wang, Yang Xiang, Yue Yu

View PDF HTML (experimental)

Abstract:Understanding the life cycle of the machine learning (ML) model is an intriguing area of research (e.g., understanding where the model comes from, how it is trained, and how it is used). This paper focuses on a novel problem within this field, namely Model Provenance (MP), which concerns the relationship between a target model and its pre-training model and aims to determine whether a source model serves as the provenance for a target model. This is an important problem that has significant implications for ensuring the security and intellectual property of machine learning models but has not received much attention in the literature. To fill in this gap, we introduce a novel concept of Model DNA which represents the unique characteristics of a machine learning model. We utilize a data-driven and model-driven representation learning method to encode the model's training data and input-output information as a compact and comprehensive representation (i.e., DNA) of the model. Using this model DNA, we develop an efficient framework for model provenance identification, which enables us to identify whether a source model is a pre-training model of a target model. We conduct evaluations on both computer vision and natural language processing tasks using various models, datasets, and scenarios to demonstrate the effectiveness of our approach in accurately identifying model provenance.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Software Engineering (cs.SE)
Cite as:	arXiv:2308.02121 [cs.LG]
	(or arXiv:2308.02121v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2308.02121

Submission history

From: Xin Mu [view email]
[v1] Fri, 4 Aug 2023 03:46:41 UTC (10,115 KB)
[v2] Wed, 17 Jul 2024 11:53:32 UTC (8,542 KB)
[v3] Thu, 18 Jul 2024 08:53:10 UTC (8,535 KB)

Computer Science > Machine Learning

Title:Model Provenance via Model DNA

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Model Provenance via Model DNA

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators