waynemadsenreport.com
robots.txt

Robots Exclusion Standard data for waynemadsenreport.com

Resource Scan

Scan Details

Site Domain waynemadsenreport.com
Base Domain waynemadsenreport.com
Scan Status Ok
Last Scan2024-09-21T00:46:17+00:00
Next Scan 2024-10-21T00:46:17+00:00

Last Scan

Scanned2024-09-21T00:46:17+00:00
URL https://waynemadsenreport.com/robots.txt
Domain IPs 104.21.79.56, 172.67.142.94, 2606:4700:3032::ac43:8e5e, 2606:4700:3036::6815:4f38
Response IP 172.67.142.94
Found Yes
Hash 3e70f41a91a52bce89146380a853e2d9119b66b1ccd3b17771a21bc067122c65
SimHash 86399c4a8911

Groups

*

Rule Path
Disallow /*print
Disallow /downloads/*
Disallow /css/*
Disallow /js/*
Disallow /extensions/*
Disallow /design/*
Disallow /custom/*
Disallow /sendfriend/*

teleport

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

bubing

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

httrack website copier

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

http weazel

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandex

Rule Path
Disallow /

Other Records

Field Value
sitemap /sitemap.xml

Comments

  • This list is compiled by Techie Zone part of Qlogix Network.
  • Baiduspider
  • Yandex