sfmlab.com
robots.txt

Robots Exclusion Standard data for sfmlab.com

Resource Scan

Scan Details

Site Domain sfmlab.com
Base Domain sfmlab.com
Scan Status Ok
Last Scan2025-05-02T23:51:34+00:00
Next Scan 2025-05-09T23:51:34+00:00

Last Scan

Scanned2025-05-02T23:51:34+00:00
URL https://sfmlab.com/robots.txt
Domain IPs 2a01:7c8:e001:bd::cedd, 89.41.170.230
Response IP 89.41.170.230
Found Yes
Hash 0925e6863955a5c50f8457b1fea910c6241530d01d783e749f7ccdadc61ece16
SimHash 601889f2a734

Groups

*

Rule Path
Allow /
Disallow /project/file/download/
Disallow /serve_file/
Disallow /media/cache/
Disallow /project/delete/
Disallow /tutorials/new/
Disallow /comments/
Disallow /static/CACHE/
Disallow /emoji/
Disallow /project/create/
Disallow /project/file/download

Other Records

Field Value
crawl-delay 2

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

googleother

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

Other Records

Field Value
sitemap http://sfmlab.com/sitemap.xml

Warnings

  • `host` is not a known field.