cuutruyen.site
robots.txt

Robots Exclusion Standard data for cuutruyen.site

Resource Scan

Scan Details

Site Domain cuutruyen.site
Base Domain cuutruyen.site
Scan Status Ok
Last Scan2026-03-09T00:23:11+00:00
Next Scan 2026-04-08T00:23:11+00:00

Last Scan

Scanned2026-03-09T00:23:11+00:00
URL https://cuutruyen.site/robots.txt
Redirect https://cuutruyen.cc/robots.txt
Redirect Domain cuutruyen.cc
Redirect Base cuutruyen.cc
Domain IPs 104.21.88.105, 172.67.176.227, 2606:4700:3033::ac43:b0e3, 2606:4700:3035::6815:5869
Redirect IPs 104.21.85.39, 172.67.201.231, 2606:4700:3037::6815:5527, 2606:4700:3037::ac43:c9e7
Response IP 172.67.201.231
Found Yes
Hash 88113f5a299a7444f4ede402e524730015bf127d720acdc4fc55bb45a3f996c4
SimHash 6c605f95e457

Groups

*

Rule Path
Allow /
Disallow /admin/
Disallow /api/
Disallow /login
Disallow /register
Disallow /password/
Allow /css/
Allow /js/
Allow /img/
Allow /fonts/

googlebot

Rule Path
Allow /
Disallow /admin/
Disallow /api/

googlebot-image

Rule Path
Allow /

Other Records

Field Value
sitemap https://cuutruyen.site/sitemap.xml

Comments

  • Allow all bots to crawl
  • Disallow admin and API endpoints
  • Allow crawling of static assets
  • Googlebot specific rules
  • Googlebot Image
  • Sitemap location