sudposteaura.fr
robots.txt

Robots Exclusion Standard data for sudposteaura.fr

Resource Scan

Scan Details

Site Domain sudposteaura.fr
Base Domain sudposteaura.fr
Scan Status Ok
Last Scan2025-11-09T19:42:02+00:00
Next Scan 2025-12-09T19:42:02+00:00

Last Scan

Scanned2025-11-09T19:42:02+00:00
URL https://sudposteaura.fr/robots.txt
Redirect https://www.sudposteaura.fr/robots.txt
Redirect Domain www.sudposteaura.fr
Redirect Base sudposteaura.fr
Domain IPs 2001:41d0:301::20, 46.105.57.169
Redirect IPs 2001:41d0:301::20, 46.105.57.169
Response IP 46.105.57.169
Found Yes
Hash b5e9d00e3078394450cf6bcfb5d6517afbc0933884e06ddfde2ce0c05101e450
SimHash ab1d1d1a4be2

Groups

googlebot

Rule Path
Disallow /site_content/tags.html*
Disallow /index.php?option=com_content*
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz$
Disallow /*?*
Disallow /*?
Disallow /*%26

baiduspider

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider-mobile

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-news

Rule Path
Disallow /

baiduspider-favo

Rule Path
Disallow /

baiduspider-sfkr

Rule Path
Disallow /

baiduspider-cpro

Rule Path
Disallow /

*

Rule Path
Disallow /administrator/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /images/headers/
Disallow /images/phocagallery/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /libraries/
Disallow /logs/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/
Disallow /html/
Disallow /bin/
Disallow /79-site/
Disallow /adherer/79-site/
Disallow /contacts/
Disallow /9-non-categorise/

Other Records

Field Value
sitemap https://www.sudposteaura.fr/sitemap.xml
sitemap http://www.sudposteaura.fr/sitemap.xml

Comments

  • If the Joomla site is installed within a folder
  • eg www.example.com/joomla/ then the robots.txt file
  • MUST be moved to the site root
  • eg www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to all of the
  • paths.
  • eg the Disallow rule for the /administrator/ folder MUST
  • be changed to read
  • Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/orig.html
  • For syntax checking, see:
  • http://tool.motoricerca.info/robots-checker.phtml
  • Disallow: /*.pdf$
  • Disallow: /component/
  • Disallow: /media/