thehia.org
robots.txt

Robots Exclusion Standard data for thehia.org

Resource Scan

Scan Details

Site Domain thehia.org
Base Domain thehia.org
Scan Status Ok
Last Scan2024-09-28T08:42:00+00:00
Next Scan 2024-10-28T08:42:00+00:00

Last Scan

Scanned2024-09-28T08:42:00+00:00
URL https://thehia.org/robots.txt
Domain IPs 162.159.135.42
Response IP 162.159.135.42
Found Yes
Hash 927905b0498587ac98b771c49950f6f2941650273d61a0f7081e0deff8988942
SimHash 609959408ce0

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-
Disallow /?s=
Disallow *%26s%3D
Disallow /search
Disallow /author/
Disallow *?attachment_id=
Disallow */feed
Disallow */rss
Disallow */embed
Disallow /wp-admin/
Disallow /wp-content/uploads/wpforms/
Allow /wp-content/uploads/
Allow /wp-content/themes/
Allow /wp-admin/admin-ajax.php
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif
Allow /wp-*.svg
Allow /wp-*.pdf

*

Rule Path
Disallow /?s=
Disallow /search

*

Rule Path
Disallow /trackback
Disallow /*trackback
Disallow /*trackback*
Disallow /*/trackback

*

Rule Path
Allow /feed/$
Disallow /feed/
Disallow /comments/feed/
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

orthogaffe

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

googlebot

Rule Path
Allow /*.css$
Allow /*.js$

Other Records

Field Value
sitemap https://thehia.org/sitemap.xml
sitemap https://thehia.org/sitemap.rss

Comments

  • robots.txt optimizations of thehia.org
  • Search blocking
  • Trackback blocking
  • RSS Feed Blocking
  • Blocking Bad bots and Bad Search Crawlers
  • Prevents resource problems blocked in Google Search Console
  • Optimized Sitemap of Your Website.
  • Website Search Engine Optimizations by https://www.fiverr.com/glaxosoft