gizmo.do
robots.txt

Robots Exclusion Standard data for gizmo.do

Resource Scan

Scan Details

Site Domain gizmo.do
Base Domain gizmo.do
Scan Status Ok
Last Scan2025-09-15T06:36:00+00:00
Next Scan 2025-09-22T06:36:00+00:00

Last Scan

Scanned2025-09-15T06:36:00+00:00
URL http://www.gizmo.do/robots.txt
Redirect https://gizmodo.com/robots.txt
Redirect Domain gizmodo.com
Redirect Base gizmodo.com
Domain IPs 185.26.106.234
Redirect IPs 104.26.2.63, 104.26.3.63, 172.67.74.173, 2606:4700:20::681a:23f, 2606:4700:20::681a:33f, 2606:4700:20::ac43:4aad
Response IP 104.26.2.63
Found Yes
Hash 974bd0918c3b6a9a3fa82751f203e1c9c409c9804b5c91b57b1b84c25c972a0b
SimHash 71101011a686

Groups

*

Rule Path
Disallow /stats/
Disallow /api/
Disallow /ajax/
Disallow /embed/
Disallow /setbucket*
Disallow /game/score/*
Disallow /game/summary/*
Disallow /gateway/*
Disallow /showcase/*
Disallow /search$
Disallow /search?
Disallow /?s=
Disallow /search/
Disallow /wp/wp-admin/
Allow /wp/wp-admin/admin-ajax.php

googlebot
bingbot

Rule Path
Allow /

oai-searchbot
chatgpt-user
claude-user
claude-searchbot
perplexitybot

Rule Path
Allow /

ai2bot
amazonbot
anthropic-ai
andibot
applebot-extended
bytespider
ccbot
claudebot
cohere-ai
diffbot
facebookbot
firecrawlagent
google-extended
google-cloudvertexbot
gptbot
imagesiftbot
kangaroo bot
meta-externalagent
mistralai-user
omgilibot
pangubot
timpibot
webzio-extended
youbot

Rule Path
Disallow /

httrack
nutch
scrapy

Rule Path
Disallow /

Other Records

Field Value
sitemap https://gizmodo.com/sitemap_index.xml
sitemap https://gizmodo.com/sitemap-news.xml
sitemap https://gizmodo.com/download/downloads.xml.gz

Comments

  • Allow traditional search indexing
  • Allow AI search and agent use
  • Disallow AI training data collection
  • Disallow site scrapers