thecreatology.com
robots.txt

Robots Exclusion Standard data for thecreatology.com

Resource Scan

Scan Details

Site Domain thecreatology.com
Base Domain thecreatology.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-11-22T00:39:48+00:00
Next Scan 2026-01-21T00:39:48+00:00

Last Successful Scan

Scanned2025-09-23T11:09:08+00:00
URL https://thecreatology.com/robots.txt
Domain IPs 195.35.44.168
Response IP 195.35.44.168
Found Yes
Hash 3dccbac37e74f451822451612a3c2eebe72aa4228d5aba94bba324be08b51a72
SimHash 6f10c91efaab

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/
Disallow /go/
Disallow /category/
Disallow /tag/
Disallow /archives/
Disallow /*?*
Allow /*?c=*
Allow /*?t=js
Disallow *?replytocom
Disallow /page/
Disallow /author/
Disallow /comments/feed/
Disallow /trackback/
Disallow /index.php
Disallow /feed/
Disallow /xmlrpc.php
Disallow *?wptheme
Disallow *?nomobile
Disallow /search?
Disallow /?p=*
Disallow /work/feed
Disallow /terms-of-service/tos
Disallow /search/feed/*
Disallow /blog/page/*
Allow /wp-content/themes/sq_120/*

mediapartners-google*

Rule Path
Allow /

adsbot-google-mobile

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads/

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

apis-google

Rule Path
Allow /

Other Records

Field Value
sitemap http://www.thecreatology.com/sitemap.xml

Warnings

  • 1 invalid line.