theirregularcorp.com
robots.txt

Robots Exclusion Standard data for theirregularcorp.com

Resource Scan

Scan Details

Site Domain theirregularcorp.com
Base Domain theirregularcorp.com
Scan Status Ok
Last Scan2024-08-31T08:02:42+00:00
Next Scan 2024-09-30T08:02:42+00:00

Last Scan

Scanned2024-08-31T08:02:42+00:00
URL https://theirregularcorp.com/robots.txt
Domain IPs 104.18.26.25, 104.18.27.25, 2606:4700::6812:1a19, 2606:4700::6812:1b19
Response IP 104.18.27.25
Found Yes
Hash 489f01b530c7e23df17b53c5d29614412658dd4d51a9ccb596388a5b497607dc
SimHash 41000c408db3

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://theirregularcorporation.com/sitemap.xml