swinsian.com
robots.txt

Robots Exclusion Standard data for swinsian.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	swinsian.com
Base Domain	swinsian.com
Scan Status	Ok
Last Scan	2025-11-12T20:53:33+00:00
Next Scan	2025-12-12T20:53:33+00:00

Last Scan

Scanned	2025-11-12T20:53:33+00:00
URL	https://swinsian.com/robots.txt
Domain IPs	2001:8d8:100f:f000::200, 217.160.0.207
Response IP	217.160.0.207
Found	Yes
Hash	e6f90f9c0e682f95317f81b0d648a63ca55d840e866f417fe24c0fbc827689a9
SimHash	76040b11c284

Groups

*

Rule	Path
Disallow	/support/sendfeedback.php
Disallow	/crashreport.php
Disallow	/thanks.html
Disallow	/sparkle/
Disallow	/sparkle_beta/
Disallow	/download/
Disallow	/download-thanks.html

Rule

Path

Disallow

/support/sendfeedback.php

Disallow

/crashreport.php

Disallow

/thanks.html

Disallow

/sparkle/

Disallow

/sparkle_beta/

Disallow

/download/

Disallow

/download-thanks.html

gptbot
claudebot
claude-user
claude-searchbot
ccbot
google-extended
applebot-extended
facebookbot
meta-externalagent
meta-externalfetcher
diffbot
perplexitybot
perplexity‑user
omgili
omgilibot
webzio-extended
imagesiftbot
bytespider
tiktokspider
amazonbot
youbot
semrushbot-ocob
petalbot
velenpublicwebcrawler
turnitinbot
timpibot
oai-searchbot
icc-crawler
ai2bot
ai2bot-dolma
dataforseobot
awariobot
awariosmartbot
awariorssbot
google-cloudvertexbot
pangubot
kangaroo bot
sentibot
img2dataset
meltwater
seekr
peer39_crawler
cohere-ai
cohere-training-data-crawler
duckassistbot
scrapy
cotoyogi
aihitbot
factset_spyderbot
firecrawlagent

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Allow	/

Rule

Path

Allow

/

Back to top

Other Records

Field	Value
sitemap	https://swinsian.com/sitemapindex.xml

Field

Value

sitemap

https://swinsian.com/sitemapindex.xml

Back to top

Warnings

`content-usage` is not a known field.
`disallowaitraining` is not a known field.

Back to top

swinsian.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

*

Other Records

Warnings

swinsian.com
robots.txt