smutba.se
robots.txt

Robots Exclusion Standard data for smutba.se

Resource Scan

Scan Details

Site Domain smutba.se
Base Domain smutba.se
Scan Status Ok
Last Scan2025-04-19T12:23:58+00:00
Next Scan 2025-04-26T12:23:58+00:00

Last Scan

Scanned2025-04-19T12:23:58+00:00
URL https://smutba.se/robots.txt
Domain IPs 2a01:7c8:e001:bd::cedd, 89.41.170.230
Response IP 89.41.170.230
Found Yes
Hash 8b90fceffeab778dc85384926e84728fa8afda6d265c52e28cfc2012233571ad
SimHash 401889f3a7b4

Groups

*

Rule Path
Allow /
Disallow /project/file/download/
Disallow /serve_file/
Disallow /media/cache/
Disallow /project/delete/
Disallow /tutorials/new/
Disallow /comments/
Disallow /static/CACHE/
Disallow /emoji/
Disallow /project/create/
Disallow /project/file/download

Other Records

Field Value
crawl-delay 2

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

googleother

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

Other Records

Field Value
sitemap http://smutba.se/sitemap.xml

Warnings

  • `host` is not a known field.