nemo33.com
robots.txt

Robots Exclusion Standard data for nemo33.com

Resource Scan

Scan Details

Site Domain nemo33.com
Base Domain nemo33.com
Scan Status Ok
Last Scan2025-10-13T06:54:53+00:00
Next Scan 2025-11-12T06:54:53+00:00

Last Scan

Scanned2025-10-13T06:54:53+00:00
URL https://nemo33.com/robots.txt
Domain IPs 104.21.45.14, 172.67.207.128, 2606:4700:3033::ac43:cf80, 2606:4700:3034::6815:2d0e
Response IP 104.21.45.14
Found Yes
Hash 07dec1124926a56a442400c955a20d75f7d97216365ff1ff38a3be45d99afe78
SimHash a31f155c47f4

Groups

*

Rule Path
Allow /administrator/
Allow /cache/
Allow /cli/
Allow /components/
Allow /images/
Allow /includes/
Allow /installation/
Allow /language/
Allow /libraries/
Allow /logs/
Allow /media/
Allow /modules/
Allow /plugins/
Allow /templates/
Allow /tmp/

*

Rule Path
Allow /d39b017

bingbot
googlebot
slurp

Rule Path
Allow

*

Rule Path
Allow /2qfeuyd

bingbot
googlebot
slurp

Rule Path
Allow

*

Rule Path
Allow /meywsvyd

bingbot
googlebot
slurp

Rule Path
Allow

*

Rule Path
Allow /9kiqk

bingbot
googlebot
slurp

Rule Path
Allow

*

Rule Path
Allow /kosc7gu

bingbot
googlebot
slurp

Rule Path
Allow

Other Records

Field Value
sitemap http://www.nemo33.com/d39b017/sitemap.xml
sitemap http://www.nemo33.com/2qfeuyd/sitemap.xml
sitemap http://www.nemo33.com/meywsvyd/sitemap.xml
sitemap http://www.nemo33.com/9kiqk/sitemap.xml
sitemap http://www.nemo33.com/kosc7gu/sitemap.xml

Comments

  • If the Joomla site is installed within a folder such as at
  • e.g. www.example.com/joomla/ the robots.txt file MUST be
  • moved to the site root at e.g. www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to the disallowed
  • path, e.g. the Disallow rule for the /administrator/ folder
  • MUST be changed to read Allow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://www.sxw.org.uk/computing/robots/check.html