allaboutnoise.com
robots.txt

Robots Exclusion Standard data for allaboutnoise.com

Resource Scan

Scan Details

Site Domain allaboutnoise.com
Base Domain allaboutnoise.com
Scan Status Ok
Last Scan2025-11-23T07:56:15+00:00
Next Scan 2025-12-23T07:56:15+00:00

Last Scan

Scanned2025-11-23T07:56:15+00:00
URL https://allaboutnoise.com/robots.txt
Redirect https://www.allaboutnoise.com/robots.txt
Redirect Domain www.allaboutnoise.com
Redirect Base allaboutnoise.com
Domain IPs 208.77.146.68
Redirect IPs 208.77.146.68
Response IP 208.77.146.68
Found Yes
Hash 57b44101403d2cf25d3412d885a05dc45e2184bd8ddcd2117ba79b54e413d516
SimHash 403bd250e58b

Groups

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

applebot

Rule Path
Allow /

yahoo

Rule Path
Allow /

yandex

Rule Path
Allow /

baiduspider

Rule Path
Allow /

grequests

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

meta-externalfetcher

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /tmp/
Disallow /private/
Disallow /config/

Comments

  • Generated robots.txt by BlueOnyx
  • Allow well-known search engines and reputable crawlers
  • Block known unwanted bots and scrapers
  • General restrictions for all other bots