webmagic.com
robots.txt

Robots Exclusion Standard data for webmagic.com

Resource Scan

Scan Details

Site Domain webmagic.com
Base Domain webmagic.com
Scan Status Ok
Last Scan2025-05-16T12:22:11+00:00
Next Scan 2025-05-30T12:22:11+00:00

Last Scan

Scanned2025-05-16T12:22:11+00:00
URL https://webmagic.com/robots.txt
Redirect https://www.webmagic.com/robots.txt
Redirect Domain www.webmagic.com
Redirect Base webmagic.com
Domain IPs 198.55.101.11
Redirect IPs 198.55.101.11
Response IP 198.55.101.11
Found Yes
Hash d11b0bb411aa3fb714e7bba16287843acef9b2436d29053bc82b382ded4e7d00
SimHash 690188400fb3

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://www.webmagic.com/sitemap_index.xml