sauge.ai
robots.txt

Robots Exclusion Standard data for sauge.ai

Resource Scan

Scan Details

Site Domain sauge.ai
Base Domain sauge.ai
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-12-23T04:44:22+00:00
Next Scan 2025-12-30T04:44:22+00:00

Last Successful Scan

Scanned2025-12-15T02:26:05+00:00
URL https://sauge.ai/robots.txt
Domain IPs 191.101.228.196, 2a02:4780:15:571f:e446:6217:8917:5c40, 2a02:4780:38:1595:684d:ca24:ae23:b670, 93.127.187.212
Response IP 93.127.196.19
Found Yes
Hash 0716be650b1f09d6b274baaf14542d4f3cb5c120f94594e598599320a14cb21f
SimHash 738059542980

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-login.php
Disallow /wp-register.php
Allow /wp-json/sauge-jobs/v1/jobs
Allow /wp-json/
Allow /wp-content/
Allow /wp-includes/
Allow /jobs/

Other Records

Field Value
sitemap https://sauge.ai/sauge-jobs-sitemap.xml
sitemap https://sauge.ai/page-sitemap.xml

Comments

  • robots.txt for https://sauge.ai
  • Last updated: March 2025
  • Only block admin and login areas
  • Explicitly allow jobs API
  • Allow everything else
  • Explicitly allow jobs
  • Sitemap declarations