intraxenglish.com
robots.txt
Robots Exclusion Standard data for intraxenglish.com
Resource Scan
Scan Details
Site Domain | intraxenglish.com |
Base Domain | intraxenglish.com |
Scan Status | Ok |
Last Scan | 2024-09-27T04:33:32+00:00 |
Next Scan | 2024-10-04T04:33:32+00:00 |
Last Scan
Scanned | 2024-09-27T04:33:32+00:00 |
URL | https://intraxenglish.com/robots.txt |
Domain IPs | 46.17.173.196 |
Response IP | 46.17.173.196 |
Found | Yes |
Hash | a7c14ce2847a7e2346f881b4c0f2421225da6c7d174d46377da351d378253256 |
SimHash | 4a405540ab09 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /wp-admin/ |
Disallow | /disclaimer/ |
Disallow | /privacy-policy/ |
Disallow | /about-us/ |
Disallow | /popular/ |
Disallow | /search |
Disallow | /comments/feed/ |
Disallow | /trackback/ |
Disallow | /index.php |
Disallow | /xmlrpc.php |
*
Rule | Path |
---|---|
Allow | /wp-includes/js/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.intraxenglish.com/post-sitemap.xml |
sitemap | https://www.intraxenglish.com/page-sitemap.xml |
sitemap | https://www.intraxenglish.com/category-sitemap.xml |
sitemap | http://cdn.attracta.com/sitemap/6159355.xml.gz |
Warnings
- 4 invalid lines.
Comments