padangbulan.co.id
robots.txt

Robots Exclusion Standard data for padangbulan.co.id

Resource Scan

Scan Details

Site Domain padangbulan.co.id
Base Domain padangbulan.co.id
Scan Status Ok
Last Scan2024-09-20T21:51:25+00:00
Next Scan 2024-09-27T21:51:25+00:00

Last Scan

Scanned2024-09-20T21:51:25+00:00
URL https://padangbulan.co.id/robots.txt
Domain IPs 66.29.153.238
Response IP 66.29.153.238
Found Yes
Hash 2a5ce07d19c5b4860efb95b7ade68fb64a0f81e96fee0089e43990c24b64c659
SimHash 6d49bdd127d1

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /language/*
Disallow /src/
Disallow /?page=*
Disallow /*?page=*
Disallow /*?page=*&s=*
Disallow /*?page=*&feed=*
Disallow /*/*?page=*

Other Records

Field Value
sitemap https://padangbulan.co.id/sitemap.xml
sitemap https://padangbulan.co.id/sitemap-news.xml
sitemap https://padangbulan.co.id/rss.xml

Comments

  • Hello search engine
  • I am robots.txt
  • I am very happy to be crawled