Feared wrote:I've backed up most of the user sites (qexpo is big, though so I hadn't had a chance). With the link above you are able to download a tar dump of the current directory you're viewing.
The qexpo stuff really isn't that huge 934mb (about 1/4 of everything)... I'm not sure if you are having this exact issue or any other issue, but qexpo2001.quakedev.com, in particular, could leave the impression that it just won't finish...
I've noticed that you can get stuck downloading booth.php because of the way the page handles the variable n and wget, unfortunately, doesn't have an adequate way of mirroring this kind of behavior.
The -A, -R switches take effect only after it makes the request ...
If anyone is having trouble with this then you could try this script (at your own peril) which should grab everything and fix the links that didn't get converted...
I don't know if this code box will work well with this... but it may save some people a bit of work.
Also, there are certainly more efficient/elegant ways to do this, but it'll give you an idea of what the problem is.
If it ends up that someone hosts the original php code on their webserver then it would be wise to fix this.
-BEWARE- The code tag appears to add whitespace after \'s for unfinished lines. You may need to remove this whitespace, or place it all on a single line.
- Code:
#!/usr/local/bin/bash
PERL=$(which perl)
WGET=$(which wget)
echo -e "Mirroring qexpo2001.quakedev.com...";
sleep 2
$WGET -m -k -K -N -rH -R"booth.php" \
-Dqexpo2001.quakedev.com,quakedev.com,ajaysquakesite.co.uk,nosferatuthegame.com \
-E -T 30 -t 1 -Xforums,wiki,phpBB3 --exclude-domains=board.nosferatuthegame.com,facelift.quakedev.com \
http://qexpo2001.quakedev.com/ ;
echo -n "Done.";
echo -e "\nDownloading booths...";
sleep 2
for (( i=1;i<=70;i+=1 ));
do
$WGET -m -k -K -N -rH -R"booth.php" \
-Dqexpo2001.quakedev.com,quakedev.com,ajaysquakesite.co.uk,nosferatuthegame.com \
-E -T 30 -t 1 -Xforums,wiki,phpBB3 --exclude-domains=board.nosferatuthegame.com,facelift.quakedev.com \
http://qexpo2001.quakedev.com/booths/booth.php?n=${i} ;
done
echo -n "Done.";
echo -e "\nFixing links in root dir...";
sleep 2
cd ./qexpo2001.quakedev.com ;
for (( i=1;i<=70;i+=1 ))
do
$PERL -pi -e "s/http:\/\/qexpo2001.quakedev.com\/booths\/booth.php\?n\=${i}"/.\/booths\/booth.php\%3Fn\=${i}.html"/g;" *.html ;
done
$PERL -pi -e "s/http:\/\/qexpo2001.quakedev.com\/booths\/createbooth.php"/.\/booths\/createbooth.php.html"/g;" *.html ;
$PERL -pi -e "s/http:\/\/forums.inside3d.com\/wwwthreads.pl\?action\=list\&Board\=QExpo/http:\/\/forums.inside3d.com\//g;" *.html ;
echo -n "Done.";
echo -e "\nFixing links in booths dir...";
sleep 2
cd ./booths ;
cp login.php.html createbooth.php.html ;
for (( i=1;i<=70;i+=1 ))
do
find . -name '*.html' -type f -exec $PERL -pi -e "s/http:\/\/qexpo2001.quakedev.com\/booths\/booth.php\?n\=${i}"/.\/booth.php\%3Fn\=${i}.html"/g;" {} \; \
-exec $PERL -pi -e "s/http:\/\/qexpo2001.quakedev.com\/booths\/upload\/${i}/.\/upload\/${i}/g;" {} \;
done
$PERL -pi -e "s/http:\/\/qexpo2001.quakedev.com\/booths\/createbooth.php"/.\/createbooth.php.html"/g;" *.html ;
$PERL -pi -e "s/http:\/\/forums.inside3d.com\/wwwthreads.pl\?action\=list\&Board\=QExpo/http:\/\/forums.inside3d.com\//g;" *.html ;
echo -n "Done.";
echo -e "\nFixing links in events dir...";
sleep 2
cd ../events ;
find . -name '*.html' -type f -exec $PERL -pi -e "s/http:\/\/qexpo2001.quakedev.com\/booths\/createbooth.php"/..\/booths\/createbooth.php.html"/g;" {} \; \
-exec $PERL -pi -e "s/http:\/\/qexpo2001.quakedev.com\/booths\/login.php"/..\/booths\/login.php"/g;" {} \;
$PERL -pi -e "s/http:\/\/forums.inside3d.com\/wwwthreads.pl\?action\=list\&Board\=QExpo/http:\/\/forums.inside3d.com\//g;" *.html ;
cd ../features ;
echo -n "Done.";
echo -e "\nFixing links in features dir...\n";
sleep 2
$PERL -pi -e "s/http:\/\/qexpo2001.quakedev.com\/booths\/createbooth.php"/..\/booths\/createbooth.php.html"/g;" *.html ;
$PERL -pi -e "s/http:\/\/forums.inside3d.com\/wwwthreads.pl\?action\=list\&Board\=QExpo/http:\/\/forums.inside3d.com\//g;" *.html ;
echo -e "All done.\n";
code edit: everything leading back to qexpo2001.quakedev.com should only be because it was a dead link anyways and the forum link is changed to forums.inside3d.com instead of the broken link.