techmediabooks.com
robots.txt

Robots Exclusion Standard data for techmediabooks.com

Resource Scan

Scan Details

Site Domain techmediabooks.com
Base Domain techmediabooks.com
Scan Status Ok
Last Scan2025-10-10T14:22:34+00:00
Next Scan 2025-11-09T14:22:34+00:00

Last Scan

Scanned2025-10-10T14:22:34+00:00
URL https://techmediabooks.com/robots.txt
Domain IPs 162.144.3.43
Response IP 162.144.3.43
Found Yes
Hash c723ca22e585af2ba9a19bf32f9b06f9b89238b5319ce3c2274f87485f6bac98
SimHash 6308d852c2b1

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /trackback/
Disallow /feed/
Disallow */trackback/
Disallow */feed/
Disallow */comments/
Disallow /*?
Disallow /*.php$
Disallow /*?vn=*

mj12bot

Rule Path
Disallow

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

ia_archiver

Rule Path
Disallow /

boomtrain-content-bot*

Rule Path
Disallow
Allow /*

Other Records

Field Value
sitemap https://www.techmediabooks.com/post-sitemap.xml