DataShield
Protect Your GitHub Data

DataShield

Protect your GitHub code from unauthorized AI training

What's Happening to Your Code

Your GitHub repositories are being scraped and used to train AI models without your consent

The Stack

6TB of permissively licensed code from GitHub

The Pile

800GB of diverse text data including code repositories

CodeParrot

Code datasets used to train AI coding assistants

Developers deserve the right to opt out or control how their code is used

How It Works

Automated protection with decentralized proof of authorship

Connect

Link your Flow wallet and GitHub account

Register Repo

Submit your GitHub repo URL

Mint Proof

Generate a Flow NFT proving authorship

AI Monitoring

Automated scanning of public datasets

Submit Claim

File opt-out or license assertion

Privacy First

Only hashed fingerprints are stored — your source code never leaves your control

Automated

AI agents do the monitoring work so you don't have to manually check datasets

Tamper-Proof

Flow provides authorship identity, IPFS ensures decentralized claim storage

Technology Stack

Built on cutting-edge blockchain and decentralized storage

Flow Logo

Flow Blockchain

  • • Secure identity management
  • • Smart contract automation
  • • NFT proof of authorship
  • • Low transaction costs
IPFS Logo

IPFS Storage

  • • Tamper-proof claim storage
  • • Decentralized data hosting
  • • Permanent record keeping
  • • Global accessibility

Community Impact

12,847

Protected Repositories

3,429

Claims Submitted

8,756

NFTs Minted