wget从kaggle下载数据
step 1:导出cookie到cookie.txt。登录kaggle.com(我用的ie11),点击收藏夹,点击添加到收藏夹右边的三角形,选择导入导出,选择导出到文件,选择cokies,选择导出。
step 2: wget -cb https://www.kaggle.com/account/login?ReturnUrl=%2Fc%2Fnoaa-fisheries-steller-sea-lion-population-count%2Fdownload%2FKaggleNOAASeaLions.7z --post-data 'username=1030997649@qq.com&password=6393374'
但是这种方法下载的很慢,最后下载下来后,解压时还出错了。
于是,我找到了这种方法去解压文件:
https://www.kaggle.com/c/noaa-fisheries-steller-sea-lion-population-count/discussion/30930
答案1:If it helps anyone: the default ubuntu Archive Manager failed to unpack the file for me. Was only showing the list of folders and files in them. I could unpack the archive by installing p7zip-full (sudo apt-get install p7zip-full) and doing 7z x Kaggle-NOAA-SeaLions -pPassword.
我用这种方法,还是没有解决问题。有CRC校验错误。
于是,我决定重新下载数据,找到了一种用aria2c的快速下载方法:
https://www.kaggle.com/c/noaa-fisheries-steller-sea-lion-population-count/discussion/32702
sudo apt install aria2
aria2c -c -x 16 -s 16 --load-cookies cookies.txt -p https://www.kaggle.com/c/noaa-fisheries-steller-sea-lion-population-count/download/KaggleNOAASeaLions.7z
posted on 2017-10-24 23:00 MissSimple 阅读(3036) 评论(0) 编辑 收藏 举报