internetlifeforum.com
robots.txt

Robots Exclusion Standard data for internetlifeforum.com

Resource Scan

Scan Details

Site Domain internetlifeforum.com
Base Domain internetlifeforum.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-05-29T20:29:25+00:00
Next Scan 2025-08-27T20:29:25+00:00

Last Successful Scan

Scanned2024-10-09T12:05:03+00:00
URL https://internetlifeforum.com/robots.txt
Domain IPs 104.21.6.175, 172.67.135.19, 2606:4700:3036::6815:6af, 2606:4700:3037::ac43:8713
Response IP 172.67.135.19
Found Yes
Hash 59f0505d6717e8e8add21a7125c7a38a91e92af0af69754e10b3f3d999622b4a
SimHash 761c45e1cee9

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /_sub/
Disallow /members/
Disallow /member.php
Disallow /newreply.php
Disallow /search.php
Disallow /install/
Disallow /profile.php
Disallow /register.php
Disallow /report.php

Other Records

Field Value
crawl-delay 4

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

npbot

Rule Path
Disallow /

slysearch

Rule Path
Disallow /

Other Records

Field Value
sitemap http://internetlifeforum.com/xmlsitemap.php

Comments

  • Our policy
  • Allowed:
  • - Search engine indexers
  • - Archival services (e.g. IA)
  • Disallowed:
  • - Marketing or SEO crawlers
  • - Bots which are too agressive by default. This is subjective, if you annoy
  • our sysadmins you'll be blocked.
  • https://github.com/NC3-LU/MOSP/blob/140d30d13c996689e22770a4f6811bc26f3f8f0c/mosp/templates/robots.txt
  • Reach out to opensource@nc3.lu if you have questions.