arsbonus.com
robots.txt

Robots Exclusion Standard data for arsbonus.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	arsbonus.com
Base Domain	arsbonus.com
Scan Status	Ok
Last Scan	2026-02-07T00:34:06+00:00
Next Scan	2026-03-09T00:34:06+00:00

Last Scan

Scanned	2026-02-07T00:34:06+00:00
URL	https://arsbonus.com/robots.txt
Domain IPs	104.21.72.109, 172.67.181.173, 2606:4700:3030::6815:486d, 2606:4700:3037::ac43:b5ad
Response IP	172.67.181.173
Found	Yes
Hash	0e0a841a41102074fe757d20d9acbb408561666fad3c1c6c1e3afaff24cb0d92
SimHash	0e1ed45083f3

Groups

facebookexternalhit

Rule	Path
Allow	/

Rule

Path

Allow

facebookexternalhit

Rule	Path
Allow	/

Rule

Path

Allow

facebookcatalog

Rule	Path
Allow	/

Rule

Path

Allow

*

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

googlebot

Rule	Path
Disallow	/

Rule

Path

Disallow

bingbot

Rule	Path
Disallow	/

Rule

Path

Disallow

slurp

Rule	Path
Disallow	/

Rule

Path

Disallow

duckduckbot

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

yandexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

sogou

Rule	Path
Disallow	/

Rule

Path

Disallow

ia_archiver

Rule	Path
Disallow	/

Rule

Path

Disallow

facebot

Rule	Path
Allow	/

Rule

Path

Allow

applebot

Rule	Path
Disallow	/

Rule

Path

Disallow

twitterbot

Rule	Path
Disallow	/

Rule

Path

Disallow

linkedinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

archive.org_bot

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack

Rule	Path
Disallow	/

Rule

Path

Disallow

wget

Rule	Path
Disallow	/

Rule

Path

Disallow

curl

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

Explicitly block archive.org
Block automated tools
Notify that this site does not want to be scraped
Note: This is not a standard robots.txt directive but some crawlers respect it

Warnings

`noindex` is not a known field.

arsbonus.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

facebookexternalhit

facebookexternalhit

facebookcatalog

*

Other Records

googlebot

bingbot

slurp

duckduckbot

baiduspider

yandexbot

sogou

ia_archiver

facebot

applebot

twitterbot

linkedinbot

archive.org_bot

httrack

wget

curl

Comments

Warnings

arsbonus.com
robots.txt