groupc.com
robots.txt
Robots Exclusion Standard data for groupc.com
Resource Scan
Scan Details
Site Domain | groupc.com |
Base Domain | groupc.com |
Scan Status | Ok |
Last Scan | 2025-06-16T08:05:47+00:00 |
Next Scan | 2025-06-23T08:05:47+00:00 |
Last Scan
Scanned | 2025-06-16T08:05:47+00:00 |
URL | https://groupc.com/robots.txt |
Redirect | https://groupcmedia.com/robots.txt |
Redirect Domain | groupcmedia.com |
Redirect Base | groupcmedia.com |
Domain IPs | 104.21.32.182, 172.67.153.172, 2606:4700:3032::ac43:99ac, 2606:4700:3037::6815:20b6 |
Redirect IPs | 162.159.135.42 |
Response IP | 162.159.135.42 |
Found | Yes |
Hash | 3e1ceb757d77a3d144963020d4bbb71e6d5566536e17dfde5ca9dd8a1fd0139d |
SimHash | 77be12c3e7e1 |
Groups
*
Rule | Path | Comment |
---|---|---|
Disallow | /wp-admin/ | block access to admin section |
Disallow | /wp-login.php | block access to admin section |
Disallow | /search/ | block access to internal search result pages |
Disallow | *?s=* | block access to internal search result pages |
Disallow | *?p=* | block access to pages for which permalinks fails |
Disallow | *%26p%3D* | block access to pages for which permalinks fails |
Disallow | *%26preview%3D* | block access to preview pages |
Disallow | /tag/ | block access to tag pages |
Disallow | /author/ | block access to author pages |
Disallow | /404-error/ | block access to 404 page |
Disallow | *.xlsx$ | - |
Disallow | *.xls$ | - |
Disallow | *.pdf$ | - |
Disallow | *.doc$ | - |
Disallow | *.docx$ | - |
*
Rule | Path |
---|---|
Disallow |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | https://groupcmedia.com/sitemap_index.xml |
sitemap | https://groupcmedia.com/post-sitemap.xml |
sitemap | https://groupcmedia.com/page-sitemap.xml |
sitemap | https://groupcmedia.com/category-sitemap.xml |
sitemap | https://groupcmedia.com/news-sitemap.xml |
sitemap | https://groupcmedia.com/editors-pick.rss |
Warnings
- 2 invalid lines.
Comments