newswise.com
robots.txt

Robots Exclusion Standard data for newswise.com

Resource Scan

Scan Details

Site Domain newswise.com
Base Domain newswise.com
Scan Status Ok
Last Scan2024-09-25T14:18:49+00:00
Next Scan 2024-10-25T14:18:49+00:00

Last Scan

Scanned2024-09-25T14:18:49+00:00
URL https://newswise.com/robots.txt
Redirect https://www.newswise.com/robots.txt
Redirect Domain www.newswise.com
Redirect Base newswise.com
Domain IPs 172.66.40.240, 172.66.43.16, 2606:4700:3108::ac42:28f0, 2606:4700:3108::ac42:2b10
Redirect IPs 172.66.40.240, 172.66.43.16, 2606:4700:3108::ac42:28f0, 2606:4700:3108::ac42:2b10
Response IP 172.66.40.240
Found Yes
Hash 390b78625120053882a41e127c6be1e9333a774a5f8ed2770c18613ea14e368e
SimHash 450cf947ee13

Groups

*

Rule Path
Allow /users/expert/
Allow /users/expert-list/

Other Records

Field Value
crawl-delay 1

*

Rule Path
Disallow /search/
Disallow /cdn-cgi/l/email-protection
Disallow /nz/feed_widget/
Disallow /nz/search_google
Disallow /reports/
Disallow /legacy/
Disallow /javascripts/
Disallow /stylesheets/
Disallow /images/uploads/
Disallow /archive/get_wire
Disallow /nz/clips_view/
Disallow /archive/list_wires
Disallow /multimedia/
Disallow /special-channel/tabarticles/
Disallow /Users/rotate-image

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

baiduspider

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

sputnikbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

majestic

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.newswise.com/sitemap.xml
sitemap https://www.newswise.com/secured_sitemap_news.xml

Comments

  • sitemap for https://www.newswise.com/
  • bots->crawl-delays
  • bots->denied & disallowed