gu.djav.org
robots.txt

Robots Exclusion Standard data for gu.djav.org

Resource Scan

Scan Details

Site Domain gu.djav.org
Base Domain djav.org
Scan Status Ok
Last Scan2025-10-30T14:30:35+00:00
Next Scan 2025-11-29T14:30:35+00:00

Last Scan

Scanned2025-10-30T14:30:35+00:00
URL https://gu.djav.org/robots.txt
Domain IPs 104.21.81.35, 172.67.137.211, 2606:4700:3034::6815:5123, 2606:4700:3036::ac43:89d3
Response IP 172.67.137.211
Found Yes
Hash fa17e9ace0326062fbc48cd0c4b5f78bae894f439e153289ad572fe00be0910e
SimHash f11a8904e213

Groups

*

Rule Path
Disallow /admin/
Allow /includes/videojs/
Disallow /controllers/
Disallow /api/
Disallow /content/
Disallow /csv_photos/
Disallow /ftp_content/
Disallow /ftp_photos/
Disallow /temp_users_uploads/
Disallow /cache/
Disallow /action.php?action=*
Disallow /filter-content/
Disallow /filters/
Disallow /filter/

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

tineye-bot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

pimeyesbot

Rule Path
Disallow /

yandeximages

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

amazonadbot

Rule Path
Disallow /

amazonproductdiscoverybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://gu.djav.org/sitemap.xml

Comments

  • Disallow: /includes/
  • Disallow: /templates/default_tube2019/template.ajax_comments.php
  • Google’s AI training bot