Cisco releases open source Model Provenance Kit for AI model tracking

Cisco has launched an open-source tool known as the Model Provenance Kit, aimed at improving the tracking of AI model lineage. This initiative addresses significant issues in model management, particularly concerning the risks of using unverified models from repositories like HuggingFace, which can lead to vulnerabilities, biases, and regulatory compliance challenges. The toolkit features two modes: “compare,” for identifying shared lineage between models, and “scan,” for matching models against Cisco’s fingerprint database. Cisco highlights that evolving practices in AI model development, such as fine-tuning and merging, complicate traditional tracking methods, necessitating the advanced signal-based fingerprinting approach implemented in this new tool. The Model Provenance Kit is now available on GitHub, providing organizations with a means to enhance their AI model governance.

Cisco: Cisco is a global technology company specializing in networking infrastructure, cybersecurity, and emerging AI defenses. It recently released the Model Provenance Kit, an open-source tool designed to fingerprint AI models and verify their origins amid rising supply chain risks. This initiative highlights Cisco’s focus on addressing vulnerabilities like model poisoning and unverified metadata in widely used AI repositories.
GitHub: GitHub is the leading web-based platform for version control and collaborative software development, hosting open-source projects worldwide. Cisco published the Model Provenance Kit repository on GitHub to enable community access and contributions to the AI model fingerprinting toolkit. The recent release has sparked quick updates and developer engagement.
Hugging Face: Hugging Face is an open platform for the machine learning community to discover, share, and collaborate on models, datasets, and applications. The news references it as a primary repository where organizations source AI models often lacking consistent provenance tracking. Cisco hosts its base model fingerprint dataset there to support the Model Provenance Kit’s scanning capabilities.
Model Provenance Kit: Model Provenance Kit is a Python toolkit and command-line interface from Cisco for detecting shared lineage among AI models through fingerprinting techniques. It features compare mode for pairwise analysis and scan mode against a precomputed database, tackling issues like inherited vulnerabilities and obscured model histories. Released open-source days ago, it provides an evidence-based approach to AI supply chain integrity.

`json
{
“Tool Modes”: “The kit offers compare mode to check shared lineage between two models and scan mode to match against Cisco’s fingerprint database.”,
“Addressed Risks”: “It mitigates threats from poisoned models, training biases, supply chain compromises, and regulatory documentation requirements for AI deployments.”,
“Provenance Evolution”: “Fine-tuning, distillation, merging, and repackaging make traditional model tracking insufficient, necessitating advanced signal-based fingerprinting.”
}
`