captain-tsubasa.com
robots.txt

Robots Exclusion Standard data for captain-tsubasa.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	captain-tsubasa.com
Base Domain	captain-tsubasa.com
Scan Status	Ok
Last Scan	2025-12-14T05:59:05+00:00
Next Scan	2025-12-28T05:59:05+00:00

Last Scan

Scanned	2025-12-14T05:59:05+00:00
URL	https://captain-tsubasa.com/robots.txt
Domain IPs	18.176.100.108
Response IP	18.176.100.108
Found	Yes
Hash	5dab329bca61b0cf021b875d6920529be9fac5cae8cf7c4dd04be9073f12ed01
SimHash	e10dc8e2c3d1

Groups

*

Rule	Path
Allow	/n/*
Allow	/m/*
Allow	/p/*
Allow	/archives/*
Allow	/followings
Allow	/followers
Allow	/likes
Allow	/membership/*
Allow	/sitemap.xml.gz
Disallow	/*/
Disallow	/embed/*
Disallow	/intent/*
Disallow	/m/*/archive

Rule

Path

Allow

/n/*

Allow

/m/*

Allow

/p/*

Allow

/archives/*

Allow

/followings

Allow

/followers

Allow

/likes

Allow

/membership/*

Allow

/sitemap.xml.gz

Disallow

/*/

Disallow

/embed/*

Disallow

/intent/*

Disallow

/m/*/archive

bingbot

Rule	Path
Allow	/n/*
Allow	/m/*
Allow	/p/*
Allow	/archives/*
Allow	/followings
Allow	/followers
Allow	/likes
Allow	/sitemap.xml.gz
Disallow	/*/
Disallow	/embed/*
Disallow	/intent/*

Rule

Path

Allow

/n/*

Allow

/m/*

Allow

/p/*

Allow

/archives/*

Allow

/followings

Allow

/followers

Allow

/likes

Allow

/sitemap.xml.gz

Disallow

/*/

Disallow

/embed/*

Disallow

/intent/*

Other Records

Field	Value
crawl-delay	30

Field

Value

crawl-delay

30

megalodon

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ia_archiver

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://captain-tsubasa.com/sitemap.xml.gz

Field

Value

sitemap

https://captain-tsubasa.com/sitemap.xml.gz

Back to top

captain-tsubasa.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

bingbot

Other Records

megalodon

ia_archiver

Other Records

captain-tsubasa.com
robots.txt