rgei.com
robots.txt

Robots Exclusion Standard data for rgei.com

Resource Scan

Scan Details

Site Domain rgei.com
Base Domain rgei.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-02-07T07:30:28+00:00
Next Scan 2025-05-08T07:30:28+00:00

Last Successful Scan

Scanned2024-06-19T23:33:32+00:00
URL https://rgei.com/robots.txt
Redirect https://www.rgei.com/robots.txt
Redirect Domain www.rgei.com
Redirect Base rgei.com
Domain IPs 13.251.4.136
Redirect IPs 2600:9000:271a:2000:12:96a4:4940:93a1, 2600:9000:271a:2c00:12:96a4:4940:93a1, 2600:9000:271a:7e00:12:96a4:4940:93a1, 2600:9000:271a:8a00:12:96a4:4940:93a1, 2600:9000:271a:ac00:12:96a4:4940:93a1, 2600:9000:271a:b200:12:96a4:4940:93a1, 2600:9000:271a:e000:12:96a4:4940:93a1, 2600:9000:271a:fc00:12:96a4:4940:93a1, 3.165.82.115, 3.165.82.30, 3.165.82.82, 3.165.82.96
Response IP 3.165.82.30
Found Yes
Hash f12a4614c29fe351ef2fd371ad316b06613ec46a6d034176ae4b4f89abe123bb
SimHash eb1c1d5983f2

Groups

httrack

Rule Path
Disallow /

*

Rule Path
Disallow /administrator/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /tmp/
Disallow /component/content/article?id=
Disallow /component/content/?id=
Disallow /group-companies-sp-373/pacific-oil-a-gas
Disallow /news-a-information/
Disallow /news-a-information?id=41
Disallow /component/content/category/
Disallow /news-a-information?id=
Disallow /about-us-sp-303/founder
Disallow /index.php?option=com_content&view=article&id=11&Itemid=57
Disallow /index.php?option=com_content&view=article&id=17&Itemid=6
Disallow /index.php?option=com_content&view=article&id=18&Itemid=7
Disallow /index.php?option=com_content&view=article&id=6&Itemid=50
Disallow /index.php?option=com_content&view=article&id=7&I
Disallow /component/content/article/8-home/31-footer-text

Other Records

Field Value
sitemap http://www.rgei.com/sitemap.xml

Comments

  • If the Joomla site is installed within a folder
  • eg www.example.com/joomla/ then the robots.txt file
  • MUST be moved to the site root
  • eg www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to all of the
  • paths.
  • eg the Disallow rule for the /administrator/ folder MUST
  • be changed to read
  • Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://tool.motoricerca.info/robots-checker.phtml