hubme.id
robots.txt

Robots Exclusion Standard data for hubme.id

Resource Scan

Scan Details

Site Domain hubme.id
Base Domain hubme.id
Scan Status Ok
Last Scan2024-09-22T13:07:23+00:00
Next Scan 2024-09-29T13:07:23+00:00

Last Scan

Scanned2024-09-22T13:07:23+00:00
URL https://hubme.id/robots.txt
Domain IPs 104.21.17.83, 172.67.175.81, 2606:4700:3030::6815:1153, 2606:4700:3031::ac43:af51
Response IP 104.21.17.83
Found Yes
Hash 72631ee598340e363c7a960261c2cf474a580959061533b945fe9b6555a02f9a
SimHash 0c1c81594de3

Groups

*

Rule Path
Allow /
Disallow /plugins/
Disallow /source/

Other Records

Field Value
sitemap https://hubme.id/sitemap/1

Comments

  • robots.txt for https://hubme.id
  • Crawlers that are kind enough to obey, but which we'd rather not have
  • unless they're feeding search engines.
  • Sitemap: https://hubme.id/sitemap/1
  • If your bot supports such a thing using the 'Crawl-delay' or another
  • instruction, please let us know. We can add it to our robots.txt.
  • Friendly, low-speed bots are welcome viewing article pages, but not
  • dynamically-generated pages please. Article pages contain our site's
  • real content.