<?xml version="1.0" encoding="Windows-1252"?>
<node id="11159142" title="Re^4: Is there a simple way to archive/download all of PerlMonks?" created="2024-04-29 05:36:30" updated="2024-04-29 05:36:30">
<type id="11">
note</type>
<author id="324763">
marto</author>
<data>
<field name="doctext">
&lt;p&gt;I didn't check all of the domains, no. I think the only valid archive would be an up to date database extract of node content (and some of the other metadata), rather partial snapshots of page impressions from a moment in time.&lt;/p&gt;
&lt;p&gt;&lt;b&gt;Update:&lt;/b&gt; I seem to recall different domains having different robots.txt rules to impact indexing.&lt;/p&gt;</field>
<field name="root_node">
11159096</field>
<field name="parent_node">
11159141</field>
</data>
</node>
