theverge.com
robots.txt

Robots Exclusion Standard data for theverge.com

Resource Scan

Scan Details

Site Domain theverge.com
Base Domain theverge.com
Scan Status Ok
Last Scan2024-06-18T15:35:57+00:00
Next Scan 2024-06-25T15:35:57+00:00

Last Scan

Scanned2024-06-18T15:35:57+00:00
URL https://theverge.com/robots.txt
Redirect https://www.theverge.com/robots.txt
Redirect Domain www.theverge.com
Redirect Base theverge.com
Domain IPs 151.101.1.91, 151.101.129.91, 151.101.193.91, 151.101.65.91
Redirect IPs 151.101.1.91, 151.101.129.91, 151.101.193.91, 151.101.65.91
Response IP 199.232.45.91
Found Yes
Hash de615d7900c8cead3c0c827aa64b349fef8c857a4de1f114407d113d1a2e9f9a
SimHash 70985940edd3

Groups

googlebot-news

Rule Path
Disallow /admin
Disallow /newfanshot
Disallow /users/*/replies
Disallow /users/*/comments
Disallow /search
Disallow /account
Disallow /login
Disallow /chorus_auth
Disallow /sso
Disallow /ad
Disallow /sponsored

*

Rule Path
Disallow /admin
Disallow /newfanshot
Disallow /users/*/replies
Disallow /users/*/comments
Disallow /search
Disallow /account
Disallow /login
Disallow /chorus_auth
Disallow /sso
Disallow */archives/*/archives*

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.theverge.com/sitemaps
sitemap https://www.theverge.com/sitemaps/authors
sitemap https://www.theverge.com/sitemaps/groups
sitemap https://www.theverge.com/sitemaps/videos
sitemap https://www.theverge.com/sitemaps/google_news