appfoundry.be
robots.txt

Robots Exclusion Standard data for appfoundry.be

Resource Scan

Scan Details

Site Domain appfoundry.be
Base Domain appfoundry.be
Scan Status Ok
Last Scan2025-11-30T19:47:44+00:00
Next Scan 2025-12-30T19:47:44+00:00

Last Scan

Scanned2025-11-30T19:47:44+00:00
URL https://appfoundry.be/robots.txt
Redirect https://www.appfoundry.be/robots.txt
Redirect Domain www.appfoundry.be
Redirect Base appfoundry.be
Domain IPs 216.150.1.1
Redirect IPs 216.150.1.129, 216.150.16.129
Response IP 216.150.16.1
Found Yes
Hash 9f333e6ff32e048af7236e1b151cbe60d5398e9e0556d625f06b16217435b501
SimHash 4d1e98a0e474

Groups

*

Rule Path
Allow /

gptbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

claude-web

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

google-extended

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

chatgpt-user

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

ccbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

anthropic-ai

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

ai2-crawler

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /
Disallow /admin/
Disallow /_next/
Disallow /api/
Disallow /.well-known/
Disallow /test/
Disallow /dev/
Disallow /staging/
Disallow /*.json$
Disallow /*.xml$
Disallow /*.txt$
Disallow /*.log$
Allow /sitemap.xml
Allow /robots.txt
Allow /ai.txt

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://appfoundry.be/sitemap.xml

Comments

  • Sitemap
  • AI Training & Content Licensing
  • See also: https://appfoundry.be/ai.txt for detailed AI usage policies
  • OpenAI GPT Crawler
  • Anthropic Claude Crawler
  • Google Bard/Gemini
  • ChatGPT Plugin/Browser
  • Common AI Training Crawlers
  • Block known scrapers that don't respect AI policies
  • Crawl-delay for respectful crawling
  • Block admin and internal paths
  • Block development and testing paths
  • Block file types that shouldn't be indexed
  • Allow important files