brights.id
robots.txt

Robots Exclusion Standard data for brights.id

Resource Scan

Scan Details

Site Domain brights.id
Base Domain brights.id
Scan Status Ok
Last Scan2026-03-01T06:45:46+00:00
Next Scan 2026-03-31T06:45:46+00:00

Last Scan

Scanned2026-03-01T06:45:46+00:00
URL https://brights.id/robots.txt
Redirect https://www.brights.id/robots.txt
Redirect Domain www.brights.id
Redirect Base brights.id
Domain IPs 45.223.167.24, 45.223.171.24
Redirect IPs 45.223.171.24
Response IP 45.223.171.24
Found Yes
Hash a245c4ee2a38d6c3b7990c6e019f53fc5519af1f77f08f6e4d8128107b98121b
SimHash a8791d7144e5

Groups

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://web-dev-admin.brights.co.id/sitemap.xml

Comments

  • This robots.txt file controls crawling of URLs under https://example.com.
  • All crawlers are disallowed to crawl files in the "includes" directory, such
  • as .css, .js, but Google needs them for rendering, so Googlebot is allowed
  • to crawl them.