swinsian.com
robots.txt

Robots Exclusion Standard data for swinsian.com

Resource Scan

Scan Details

Site Domain swinsian.com
Base Domain swinsian.com
Scan Status Ok
Last Scan2025-11-12T20:53:33+00:00
Next Scan 2025-12-12T20:53:33+00:00

Last Scan

Scanned2025-11-12T20:53:33+00:00
URL https://swinsian.com/robots.txt
Domain IPs 2001:8d8:100f:f000::200, 217.160.0.207
Response IP 217.160.0.207
Found Yes
Hash e6f90f9c0e682f95317f81b0d648a63ca55d840e866f417fe24c0fbc827689a9
SimHash 76040b11c284

Groups

*

Rule Path
Disallow /support/sendfeedback.php
Disallow /crashreport.php
Disallow /thanks.html
Disallow /sparkle/
Disallow /sparkle_beta/
Disallow /download/
Disallow /download-thanks.html

gptbot
claudebot
claude-user
claude-searchbot
ccbot
google-extended
applebot-extended
facebookbot
meta-externalagent
meta-externalfetcher
diffbot
perplexitybot
perplexity‑user
omgili
omgilibot
webzio-extended
imagesiftbot
bytespider
tiktokspider
amazonbot
youbot
semrushbot-ocob
petalbot
velenpublicwebcrawler
turnitinbot
timpibot
oai-searchbot
icc-crawler
ai2bot
ai2bot-dolma
dataforseobot
awariobot
awariosmartbot
awariorssbot
google-cloudvertexbot
pangubot
kangaroo bot
sentibot
img2dataset
meltwater
seekr
peer39_crawler
cohere-ai
cohere-training-data-crawler
duckassistbot
scrapy
cotoyogi
aihitbot
factset_spyderbot
firecrawlagent

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://swinsian.com/sitemapindex.xml

Warnings

  • `content-usage` is not a known field.
  • `disallowaitraining` is not a known field.