backlinkaz.com
robots.txt

Robots Exclusion Standard data for backlinkaz.com

Resource Scan

Scan Details

Site Domain backlinkaz.com
Base Domain backlinkaz.com
Scan Status Ok
Last Scan2024-11-07T16:34:53+00:00
Next Scan 2024-12-07T16:34:53+00:00

Last Scan

Scanned2024-11-07T16:34:53+00:00
URL https://backlinkaz.com/robots.txt
Domain IPs 104.21.73.108, 172.67.189.169, 2606:4700:3036::ac43:bda9, 2606:4700:3037::6815:496c
Response IP 104.21.73.108
Found Yes
Hash e76323b02fb27156af3dadb96e6b54c70dcb3bc644c63273c7f578b792ef77ed
SimHash 5309c8427db3

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /wp-admin/images/*
Disallow /wp-includes/
Allow /wp-includes/js
Allow /wp-includes/css
Disallow /trackback/
Disallow /xmlrpc.php
Disallow /*/feed/*
Disallow /*/*?s=*
Disallow /*/*.inc$
Disallow /transfer/
Disallow /refer/
Disallow /*/cgi-bin/*
Disallow /*/blackhole/*
Disallow /*/trackback/*
Disallow /*/xmlrpc.php
Disallow /suggest/?*
Disallow /readme.html
Disallow /*?hpp_next=*

baiduspider
easouspider
ezooms
yandexbot
mj12bot
sitesucker
httrack
httrack website copier
teleport
teleportpro
emailcollector
emailsiphon
webbandit
webzip
webreaper
webstripper
web downloader
webcopier
offline explorer pro
offline commander
leech
websnake
blackwidow
http weazel

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://backlinkaz.com/sitemap_index.xml

Comments

  • Disallow: /wp-content/plugins/
  • Disallow: /wp-content/themes/
  • protect my site from HTTrack or other software's ripping?