tuxmail.com
robots.txt

Robots Exclusion Standard data for tuxmail.com

Resource Scan

Scan Details

Site Domain tuxmail.com
Base Domain tuxmail.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-08-17T08:09:12+00:00
Next Scan 2025-11-15T08:09:12+00:00

Last Successful Scan

Scanned2024-07-01T02:02:16+00:00
URL http://tuxmail.com/robots.txt
Redirect https://online.pt/robots.txt
Redirect Domain online.pt
Redirect Base online.pt
Domain IPs 195.200.253.125
Redirect IPs 185.83.248.141
Response IP 185.83.248.141
Found Yes
Hash ab762c5a6337ab6a2fc734932211a1c5289b0ea3dafdc02dff96de03b874086c
SimHash 29b51d216555

Groups

*

Rule Path
Disallow /gestao/

Comments

  • ****************************************************************************
  • robots.txt
  • : Robots, spiders, and search engines use this file to detmine which
  • content they should *not* crawl while indexing your website.
  • : This system is called "The Robots Exclusion Standard."
  • : It is strongly encouraged to use a robots.txt validator to check
  • for valid syntax before any robots read it!
  • Examples:
  • Instruct all robots to stay out of the admin area.
  • : User-agent: *
  • : Disallow: /admin/
  • Restrict Google and MSN from indexing your images.
  • : User-agent: Googlebot
  • : Disallow: /images/
  • : User-agent: MSNBot
  • : Disallow: /images/
  • ****************************************************************************