idrowiki.org
robots.txt

Robots Exclusion Standard data for idrowiki.org

Resource Scan

Scan Details

Site Domain idrowiki.org
Base Domain idrowiki.org
Scan Status Ok
Last Scan2025-11-29T19:26:47+00:00
Next Scan 2025-12-06T19:26:47+00:00

Last Scan

Scanned2025-11-29T19:26:47+00:00
URL https://idrowiki.org/robots.txt
Domain IPs 104.21.52.29, 172.67.194.156, 2606:4700:3031::6815:341d, 2606:4700:3032::ac43:c29c
Response IP 172.67.194.156
Found Yes
Hash 9ba96e49348a8d62d3a5249f48e69117d0f6ff7d5ecf010d5ae2c60cd1a87e10
SimHash e805cf641377

Groups

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow

duggmirror

Rule Path
Disallow /

*

Rule Path
Disallow /site.html
Disallow /cgi-bin/
Allow /w/load.php?
Allow /w/images/
Allow /w/resources/assets/
Disallow /w/index.php?
Disallow /w/
Disallow /wiki/Special%3ASearch
Disallow /wiki/Special%3ARandom
Disallow /wiki/Istimewa%3APencarian
Disallow /wiki/Ragnarok
Allow /c/load.php?
Allow /c/images/
Allow /c/resources/assets/
Disallow /c/index.php?
Disallow /klasik/Special%3ASearch
Disallow /klasik/Special%3ARandom
Disallow /klasik/Istimewa%3APencarian
Disallow /klasik/Ragnarok

Other Records

Field Value
sitemap http://idrowiki.org/sitemap.xml
sitemap http://idrowiki.org/c/sitemap/sitemap-index.xml

Comments

  • Google Image
  • Google AdSense
  • digg mirror
  • global
  • Disallow: /resources/