studycorgi.com
robots.txt

Robots Exclusion Standard data for studycorgi.com

Resource Scan

Scan Details

Site Domain studycorgi.com
Base Domain studycorgi.com
Scan Status Ok
Last Scan2024-06-03T05:51:04+00:00
Next Scan 2024-07-03T05:51:04+00:00

Last Scan

Scanned2024-06-03T05:51:04+00:00
URL https://studycorgi.com/robots.txt
Domain IPs 104.21.25.165, 172.67.134.99, 2606:4700:3033::6815:19a5, 2606:4700:3035::ac43:8663
Response IP 172.67.134.99
Found Yes
Hash 477b79da80716f80c835e516ad12cc74832f5b8c6fcebd4af1b5c33d35a2c8d5
SimHash 311e113386d0

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /cdn-cgi
Disallow /?
Disallow /wp-
Disallow *?s=
Disallow *%26s%3D
Disallow /search/
Disallow /author/
Disallow /users/
Disallow */trackback
Disallow */feed/
Disallow */rss/
Disallow */wlwmanifest.xml
Disallow /xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D
Disallow /privacy/
Disallow /terms/
Disallow *-page3.webp
Disallow *-page4.webp
Disallow *-page5.webp
Disallow /page/*
Allow /page/2/
Allow /page/3/
Allow /page/4/
Allow /page/5/
Allow /page/6/
Allow /page/7/
Allow /page/8/
Allow */uploads
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif
Allow /wp-*.svg
Allow /wp-admin/admin-ajax.php
Allow /wp-*.webp

turnitinbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://studycorgi.com/sitemap.xml

Comments

  • Sitemap