agrtech.com.au
robots.txt

Robots Exclusion Standard data for agrtech.com.au

Resource Scan

Scan Details

Site Domain agrtech.com.au
Base Domain agrtech.com.au
Scan Status Ok
Last Scan2024-09-15T14:52:09+00:00
Next Scan 2024-10-15T14:52:09+00:00

Last Scan

Scanned2024-09-15T14:52:09+00:00
URL https://agrtech.com.au/robots.txt
Domain IPs 192.124.249.68
Response IP 192.124.249.68
Found Yes
Hash 03fc097544b73c14bfbc4e238923f1b41bf9f89954c683febea0f063282f844c
SimHash b0105d03c764

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Disallow /xmlrpc.php
Disallow /cgi-bin
Disallow /wp-admin$
Disallow /wp-admin*/
Allow /author/admin/
Disallow /category/uncategorized$
Disallow /category/uncategorized/
Disallow */trackback/
Disallow /print$
Disallow /print/
Disallow /search$
Disallow /search/
Allow /wp-admin/admin-ajax.php
Allow /*.js
Allow /*.css
Disallow /xmlrpc.php$
Disallow /recommends/
Disallow /assets
Allow /assets/brand

googlebot-image

Rule Path
Allow /wp-content/uploads/

twitterbot

Rule Path
Allow /

baiduspider-image

Rule Path
Allow /wp-content/uploads/

ninjabot

Rule Path
Allow /

rogerbot

Rule Path
Disallow /

voltron

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

stress-agent

Rule Path
Disallow /

mediapartners-google*

Rule Path
Disallow /

teleport

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

larbin

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

npbot

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

wesee

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

fast

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow

Other Records

Field Value
sitemap https://agrtech.com.au/sitemap_index.xml
sitemap https://agrtech.com.au/sitemap.xml
sitemap https://agrtech.com.au/post-sitemap.xml
sitemap https://agrtech.com.au/page-sitemap.xml

Comments

  • Robots.txt file for AGR Technology
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txt
  • Ignored: http://example.com/site/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/robotstxt.html
  • ____________________________
  • !\_________________________/!\
  • !! !! \
  • !! !! \
  • !! C:\AGR_Technology.exe !! !
  • !! !! !
  • !! !! !
  • !! !! !
  • !! !! !
  • !! !! /
  • !!_________________________!! /
  • !/_________________________\!/
  • __\_________________/__/!_
  • !_______________________!/ )
  • ________________________ (__
  • /oooo oooo oooo oooo /! _ )_
  • /ooooooooooooooooooooooo/ / (_)_(_)
  • /ooooooooooooooooooooooo/ / (o o)
  • /_______________________/_/ ==\o/==
  • Sitemaps
  • 80legs
  • disallow stress test
  • advertising-related bots:
  • Misbehaving: requests much too fast:

Warnings

  • 2 invalid lines.