dev.semagi.com
robots.txt

Robots Exclusion Standard data for dev.semagi.com

Resource Scan

Scan Details

Site Domain dev.semagi.com
Base Domain semagi.com
Scan Status Ok
Last Scan2025-09-06T09:46:27+00:00
Next Scan 2025-10-06T09:46:27+00:00

Last Scan

Scanned2025-09-06T09:46:27+00:00
URL https://dev.semagi.com/robots.txt
Domain IPs 35.213.139.179
Response IP 35.213.139.179
Found Yes
Hash 4c29926f41aa53e67f85ef9fe5fd0f3360266600640f566f8f3250e39f035976
SimHash 0c1fdf202f97

Groups

*

Rule Path
Allow /
Disallow /profile/
Disallow /recents/
Disallow /purchases/
Disallow /login/
Disallow /register/
Disallow /credits/success/
Disallow /credits/cancel/
Disallow /api/auth/
Disallow /api/user/
Disallow /api/task/
Disallow /api/payment/
Disallow /_next/
Disallow /out/
Disallow /.env*
Disallow /temp/
Disallow /logs/

googlebot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

baiduspider

Rule Path
Allow /
Allow /zh/

Other Records

Field Value
crawl-delay 2

badbot
semrushbot
ahrefsbot
mj12bot
dotbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://dev.semagi.com/sitemap.xml