thecitiesportal.com
robots.txt

Robots Exclusion Standard data for thecitiesportal.com

Resource Scan

Scan Details

Site Domain thecitiesportal.com
Base Domain thecitiesportal.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-01-28T23:08:16+00:00
Next Scan 2026-04-28T23:08:16+00:00

Last Successful Scan

Scanned2025-03-11T23:33:14+00:00
URL https://thecitiesportal.com/robots.txt
Domain IPs 104.21.20.80, 172.67.191.251, 2606:4700:3032::6815:1450, 2606:4700:3035::ac43:bffb
Response IP 172.67.191.251
Found Yes
Hash 8718515c23f03b8550a8280723092c40ca9bbbe38afdd0ff5651090afafae731
SimHash 2138b9638eba

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /cgi-bin
Disallow /?
Disallow /wp-
Disallow /wp/
Disallow *?s=
Disallow *%26s%3D
Disallow /search/
Disallow /author/
Disallow /users/
Disallow */trackback
Disallow */feed
Disallow */rss
Disallow */embed
Disallow /xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D
Allow */uploads
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif

Other Records

Field Value
sitemap https://thecitiesportal.com/sitemap.xml