xoilacsg.cc
robots.txt

Robots Exclusion Standard data for xoilacsg.cc

Resource Scan

Scan Details

Site Domain xoilacsg.cc
Base Domain xoilacsg.cc
Scan Status Ok
Last Scan2025-10-07T09:22:05+00:00
Next Scan 2025-11-06T09:22:05+00:00

Last Scan

Scanned2025-10-07T09:22:05+00:00
URL https://xoilacsg.cc/robots.txt
Redirect https://ineer.org/robots.txt
Redirect Domain ineer.org
Redirect Base ineer.org
Domain IPs 104.26.12.21, 104.26.13.21, 172.67.68.132, 2606:4700:20::681a:c15, 2606:4700:20::681a:d15, 2606:4700:20::ac43:4484
Redirect IPs 104.26.4.67, 104.26.5.67, 172.67.69.200, 2606:4700:20::681a:443, 2606:4700:20::681a:543, 2606:4700:20::ac43:45c8
Response IP 104.26.5.67
Found Yes
Hash 29374b055e6cb94561d26d06c725e3c08260dbe5b1cf40d26e5f51d95295f2e6
SimHash 6b58d80449b2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /author/
Disallow /*/trackback
Disallow /tag/
Disallow /*/feed
Disallow /?s=*
Disallow /attachment/
Disallow /*?utm_source
Disallow /*%26utm_source

facebookexternalhit

Rule Path
Allow /

Other Records

Field Value
sitemap https://ineer.org/sitemap.xml

Comments

  • Allow Facebook scraper