demo-v3.katadata.co.id
robots.txt

Robots Exclusion Standard data for demo-v3.katadata.co.id

Resource Scan

Scan Details

Site Domain demo-v3.katadata.co.id
Base Domain katadata.co.id
Scan Status Ok
Last Scan2024-04-18T01:50:14+00:00
Next Scan 2024-05-18T01:50:14+00:00

Last Scan

Scanned2024-04-18T01:50:14+00:00
URL https://demo-v3.katadata.co.id/robots.txt
Domain IPs 172.66.40.242, 172.66.43.14, 2606:4700:3108::ac42:28f2, 2606:4700:3108::ac42:2b0e
Response IP 172.66.40.242
Found Yes
Hash 5c19b7115e2e7df12c08b9425ce3a3afcdd3f6ac3fe87dbe6c095bae999efc51
SimHash 38129d194764

Groups

*

Rule Path
Disallow *
Disallow /admincms/
Disallow /catatan.txt
Disallow /admincms
Disallow /en
Disallow /en_dev
Disallow /en_mobile
Disallow /en_mobile_dev

Comments

  • robots.txt
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txt
  • Ignored: http://example.com/site/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/robotstxt.html
  • Crawl-delay: 10
  • Directories
  • Files
  • Paths