Putting Up the 'Do Not Enter' Sign
What is it?
Major AI companies use bots (like OpenAI'sGPTBot or Common Crawl's CCBot) to scan the entire internet to train their models. By adding a few lines of code to your website's robots.txt file, you technically tell these bots: 'You are not allowed to read or use my content.'Why is it important?
If you have unique product descriptions, proprietary blog posts, or pricing data, you might not want AI companies using your hard work to train their models (which might eventually help your competitors). It's a basic layer of data sovereignty.How to do it in Shopify:
-
Access the File: Shopify allows you to customize robots.txt via the
robots.txt.liquidfile in your Theme code editor (or via a dedicated app). -
Add the Block: Insert code like:
User-agent: GPTBot
Disallow: /
User-agent: CCBot
Disallow: / - Verify: Use Google Search Console's robots.txt tester to ensure you didn't accidentally block Googlebot (which kills your SEO).
Reality Check: This relies on the 'honor system.' Honest bots (like OpenAI) respect it. Malicious scrapers ignore it completely.
DijiPilot Academy Access Required
This comprehensive masterclass (8.8.9.5 - AI-Enabled Brand Defense & Integrity for E-commerce (Difficulty: Advanced | Ethics: White Hat | Path: Scale)) is locked. Upgrade your plan to unlock the full technical roadmap.
Loading lesson roadmap for Phase 8.8.9.5...
Questions & Answers
Reviewing this step? Browse questions from other DijiPilot users below. If you are stuck, check the existing answers to bridge the gap between setup and success.