newsroompost.com
robots.txt

Robots Exclusion Standard data for newsroompost.com

Resource Scan

Scan Details

Site Domain newsroompost.com
Base Domain newsroompost.com
Scan Status Ok
Last Scan2024-05-17T15:49:34+00:00
Next Scan 2024-05-24T15:49:34+00:00

Last Scan

Scanned2024-05-17T15:49:34+00:00
URL https://newsroompost.com/robots.txt
Domain IPs 104.26.14.119, 104.26.15.119, 172.67.68.21, 2606:4700:20::681a:e77, 2606:4700:20::681a:f77, 2606:4700:20::ac43:4415
Response IP 172.67.68.21
Found Yes
Hash 7137766b4412065b843621fef6643c91c1d8df2e193fe719d26c404b87c65a19
SimHash 2d4d7c488b53

Groups

*

Rule Path
Allow /
Disallow */page/*
Disallow */attachment/*
Disallow /wp-admin/
Disallow */cdn-cgi/*
Disallow /favicon.ico
Disallow *.html/1*
Disallow *.html/2*
Disallow *?redirect*
Disallow *?s*
Disallow */h/g/cv/*
Disallow *?p*
Disallow *?s*
Disallow *?_gl*
Disallow */tag/*

Other Records

Field Value
sitemap https://newsroompost.com/sitemap.xml