gadgetbond.com
robots.txt

Robots Exclusion Standard data for gadgetbond.com

Resource Scan

Scan Details

Site Domain gadgetbond.com
Base Domain gadgetbond.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-10-12T23:20:23+00:00
Next Scan 2025-12-11T23:20:23+00:00

Last Successful Scan

Scanned2025-08-20T19:22:49+00:00
URL https://gadgetbond.com/robots.txt
Domain IPs 195.179.239.193, 2a02:4780:1:1381:0:13cb:892c:2
Response IP 195.179.239.193
Found Yes
Hash 3024a8979f1e73d40df21fd500d6680c2471baa93322646ca3c1ed438fc5ee8c
SimHash 788e1943c4a4

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /search
Disallow /?s=
Disallow /wp-login.php
Disallow /admin/
Disallow /login/

mediapartners-google

Rule Path
Disallow

google-display-ads-bot

Rule Path
Disallow

google-extended

Rule Path
Allow /

google-cloudvertexbot

Rule Path
Allow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

applebot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

news-please

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

quora-bot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

facebookexternalhit

Rule Path
Allow /*?*smid=

twitterbot

Rule Path
Allow /*?*smid=

Other Records

Field Value
sitemap https://gadgetbond.com/sitemap.xml

Comments

  • robots.txt
  • This file prevents crawling and indexing of certain parts
  • of your site by web crawlers and spiders.
  • It helps save bandwidth and server resources.
  • General Rules for All Bots
  • Google Services (Ensuring Ads and AI Features Work)
  • Block AI Crawlers & Data-Scraping Bots
  • Social Media Bots (Allow for Rich Previews)
  • Sitemaps