scrapy.org
robots.txt

Robots Exclusion Standard data for scrapy.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	scrapy.org
Base Domain	scrapy.org
Scan Status	Ok
Last Scan	2025-10-23T10:21:33+00:00
Next Scan	2025-11-22T10:21:33+00:00

Last Scan

Scanned	2025-10-23T10:21:33+00:00
URL	https://scrapy.org/robots.txt
Redirect	https://www.scrapy.org/robots.txt
Redirect Domain	www.scrapy.org
Redirect Base	scrapy.org
Domain IPs	76.76.21.21
Redirect IPs	66.33.60.193, 66.33.60.35
Response IP	76.76.21.142
Found	Yes
Hash	9a2512878f3d4db9aa923304dcfe05c9cb5cc0dd397c114f2bd62c09ac4062b6
SimHash	479509517556

Groups

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

/

google-extended

Rule	Path
Allow	/

Rule

Path

Allow

/

bingbot

Rule	Path
Allow	/

Rule

Path

Allow

/

bingpreview

Rule	Path
Allow	/

Rule

Path

Allow

/

gptbot

Rule	Path
Allow	/

Rule

Path

Allow

/

ccbot

Rule	Path
Allow	/

Rule

Path

Allow

/

chatgpt-user

Rule	Path
Allow	/

Rule

Path

Allow

/

perplexitybot

Rule	Path
Allow	/

Rule

Path

Allow

/

Back to top

Other Records

Field	Value
sitemap	https://scrapy.org/sitemap.xml

Field

Value

sitemap

https://scrapy.org/sitemap.xml

Back to top

Comments

robots.txt for Scrapy Redesign (private/staging)

Back to top

scrapy.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

googlebot

google-extended

bingbot

bingpreview

gptbot

ccbot

chatgpt-user

perplexitybot

Other Records

Comments

scrapy.org
robots.txt