断点续传的原理(转)

断点续传的原理(转)[@more@]zt from http://www.chinajavaworld.com

(一)断点续传的原理

其实断点续传的原理很简单，就是在Http的请求上和一般的下载有所不同而已。

打个比方，浏览器请求服务器上的一个文时，所发出的请求如下：

假设服务器域名为www.sjtu.edu.cn，文件名为down.zip。

GET /down.zip HTTP/1.1

Accept:image/gif, image/x-xbitmap, image/jpeg, image/pjpeg, application/vnd.ms-excel, application/msword, application/vnd.ms-powerpoint, */*

Accept-Language: zh-cn

Accept-Encoding: gzip, deflate

User-Agent: Mozilla/4.0 (compatible; MSIE 5.01; Windows NT 5.0)

Connection: Keep-Alive

服务器收到请求后，按要求寻找请求的文件，提取文件的信息，然后返回给浏览器，返回信息如下：

200

Content-Length=106786028

Accept-Ranges=bytes

Date=Mon, 30 Apr 2001 12:56:11 GMT

ETag=W/"02ca57e173c11:95b"

Content-Type=application/octet-stream

Server=Microsoft-IIS/5.0

Last-Modified=Mon, 30 Apr 2001 12:56:11 GMT　

所谓断点续传，也就是要从文件已经下载的地方开始继续下载。所以在客户端浏览器传给Web服务器的时候要多加一条信息--从哪里开始。

下面是用自己编的一个"浏览器"来传递请求信息给Web服务器，要求从2000070字节开始。

GET /down.zip HTTP/1.0

User-Agent: NetFox

RANGE: bytes=2000070-

Accept: text/html, image/gif, image/jpeg, *; q=.2, */*; q=.2

仔细看一下就会发现多了一行RANGE: bytes=2000070-

这一行的意思就是告诉服务器down.zip这个文件从2000070字节开始传，前面的字节不用传了。服务器收到这个请求以后，返回的信息如下：

206

Content-Length=106786028

Content-Range=bytes 2000070-106786027/106786028

Date=Mon, 30 Apr 2001 12:55:20 GMT

ETag=W/"02ca57e173c11:95b"

Content-Type=application/octet-stream

Server=Microsoft-IIS/5.0

Last-Modified=Mon, 30 Apr 2001 12:55:20 GMT

和前面服务器返回的信息比较一下，就会发现增加了一行：

Content-Range=bytes 2000070-106786027/106786028

返回的代码也改为206了，而不再是200了。

知道了以上原理，就可以进行断点续传的编程了。

(二)Java实现断点续传的关键几点

(1)用什么方法实现提交RANGE: bytes=2000070-。

当然用最原始的Socket是肯定能完成的，不过那样太费事了，其实Java的net包中提供了这种功能。代码如下：

		URL url = new URL("http://www.sjtu.edu.cn/down.zip");
		HttpURLConnection httpConnection = (HttpURLConnection)url.openConnection　();//设置User-Agent
		httpConnection.setRequestProperty("User-Agent","NetFox");//设置断点续传的开始位置
		httpConnection.setRequestProperty("RANGE","bytes=2000070");//获得输入流
		InputStream input =httpConnection.getInputStream();

从输入流中取出的字节流就是down.zip文件从2000070开始的字节流。

大家看，其实断点续传用Java实现起来还是很简单的吧。

接下来要做的事就是怎么保存获得的流到文件中去了。

保存文件采用的方法。

我采用的是IO包中的RandAccessFile类。

操作相当简单，假设从2000070处开始保存文件，代码如下：

		RandomAccess oSavedFile = new RandomAccessFile("down.zip","rw");
		long nPos = 2000070;//定位文件指针到nPos位置
		oSavedFile.seek(nPos);
		byte[] b = new byte[1024];
		int nRead;//从输入流中读入字节流，然后写到文件中
		while((nRead=input.read(b,0,1024)) > 0){
		      oSavedFile.write(b,0,nRead);
		}

怎么样，也很简单吧。

接下来要做的就是整合成一个完整的程序了。包括一系列的线程控制等等。

(三)断点续传内核的实现主要用了6个类，包括一个测试类。

SiteFileFetch.java负责整个文件的抓取，控制内部线程(FileSplitterFetch类)。

FileSplitterFetch.java负责部分文件的抓取。

FileAccess.java负责文件的存储。

SiteInfoBean.java要抓取的文件的信息，如文件保存的目录，名字，抓取文件的URL等。

Utility.java工具类，放一些简单的方法。

TestMethod.java测试类。

下面例子的思路就是在下载的时候把一些即时信息,如已下载的文件的长度等信息存放到临时文件中,下载连接时首先判断临时文件是否存在,不存在就重新下,存在就读取临时文件中的以下载长度,然后从这个点再开始下。

下面是源程序：

SiteFileFetch.java类文件：

package nc.pub.ump.download;

import java.io.DataInputStream;
import java.io.DataOutputStream;
import java.io.File;
import java.io.FileInputStream;
import java.io.FileOutputStream;
import java.io.IOException;
import java.net.HttpURLConnection;
import java.net.URL;

/**
 * 传输文件线程类。
 * @author whs
 * @version 1.0
 */
public class SiteFileFetch extends Thread {
	SiteInfoBean siteInfoBean = null;
	long[] nPos;		//	文件位置指针 
	long[] nStartPos;	//	开始位置 
	long[] nEndPos;		//	结束位置 
	FileSplitterFetch[] fileSplitterFetch;	//	子线程对象
	long nFileLength;	//	文件长度 
	boolean bFirst = true;	//	是否第一次读取 
	boolean bStop = false;	// 	停止标志 
	File tmpFile; 			//	文件传输临时信息 
	DataOutputStream output;//	输出到文件的输出流 
	public SiteFileFetch(SiteInfoBean bean) throws IOException {
		siteInfoBean = bean;
		tmpFile = new File(bean.getSFilePath() + File.separator + bean.getSFileName() + ".info");
		if (tmpFile.exists()) {
			bFirst = false;
			read_nPos();
		} else {
			nStartPos = new long[bean.getNSplitter()];
			nEndPos = new long[bean.getNSplitter()];
		}
	}

	public void run() {
		try {
			if (bFirst) {
				// 获得文件长度
				nFileLength = getFileSize();
				if (nFileLength == -1) {
					System.err.println("File Length is not known");
				} else if (nFileLength == -2) {
					System.err.println("File is not access!");
				} else {
					// 分割下载文件
					for (int i = 0; i < nStartPos.length; i++) {
						nStartPos[i] = (long) (i * (nFileLength / nStartPos.length));
					}
					for (int i = 0; i < nEndPos.length - 1; i++) {
						nEndPos[i] = nStartPos[i + 1];
					}
					nEndPos[nEndPos.length - 1] = nFileLength;
				}
			}
			// 创建FileSplitterFetch类实例
			fileSplitterFetch = new FileSplitterFetch[nStartPos.length];
			// 启动FileSplitterFetch线程
			for (int i = 0; i < nStartPos.length; i++) {
				fileSplitterFetch[i] = new FileSplitterFetch(siteInfoBean.getSSiteURL(),
						siteInfoBean.getSFilePath() + File.separator + siteInfoBean.getSFileName(),
						nStartPos[i], nEndPos[i], i);
				Utility.log("Thread " + i + ",nStartPos=" + nStartPos[i] + " ,nEndPos= " + nEndPos[i]);
				fileSplitterFetch[i].start();
			}
			boolean breakWhile = false;
			// 等待子线程结束
			while (!bStop) {
				write_nPos();
				Utility.sleep(500);
				breakWhile = true;
				for (int i = 0; i < nStartPos.length; i++) {
					// 等待子线程返回
					if (!fileSplitterFetch[i].bDownOver) {
						breakWhile = false;
						break;
					}
				}
				// 是否结束while循环
				if (breakWhile)
					break;
			}
			System.out.println("文件传输结束!");
		} catch (Exception e) {
			e.printStackTrace();
		}
	}

	/**
	 * 获得文件长度
	 * @return
	 */
	public long getFileSize() {
		int nFileLength = -1;
		try {
			// 创建与WEB服务器的连接
			URL url = new URL(siteInfoBean.getSSiteURL());
			HttpURLConnection httpConnection = (HttpURLConnection) url.openConnection();
			httpConnection.setRequestProperty("User-Agent", "sample.resumebrokentransfer");
			int responseCode = httpConnection.getResponseCode();
			if (responseCode >= 400) {
				processErrorCode(responseCode);
				// -2为WEB服务器响应错误
				return -2;
			}

			String sHeader;
			for (int i = 1;; i++) {
				sHeader = httpConnection.getHeaderFieldKey(i);
				if (sHeader != null) {
					if (sHeader.equals("Content-Length")) {
						nFileLength = Integer.parseInt(httpConnection.getHeaderField(sHeader));
						break;
					}
				} else {
					break;
				}
			}
		} catch (IOException e) {
			e.printStackTrace();
		} catch (Exception e) {
			e.printStackTrace();
		}
		Utility.log(nFileLength);
		return nFileLength;
	}

	/**
	 * 保存传输文件指针位置
	 */
	private void write_nPos() {
		try {
			output = new DataOutputStream(new FileOutputStream(tmpFile));
			output.writeInt(nStartPos.length);
			for (int i = 0; i < nStartPos.length; i++) {
				output.writeLong(fileSplitterFetch[i].nStartPos);
				output.writeLong(fileSplitterFetch[i].nEndPos);
			}
			output.close();
		} catch (IOException e) {
			e.printStackTrace();
		} catch (Exception e) {
			e.printStackTrace();
		}
	}

	/**
	 * 读取保存的下载文件指针位置
	 */
	private void read_nPos() {
		try {
			DataInputStream input = new DataInputStream(new FileInputStream(tmpFile));
			int nCount = input.readInt();
			nStartPos = new long[nCount];
			nEndPos = new long[nCount];
			for (int i = 0; i < nStartPos.length; i++) {
				nStartPos[i] = input.readLong();
				nEndPos[i] = input.readLong();
			}
			input.close();
		} catch (IOException e) {
			e.printStackTrace();
		} catch (Exception e) {
			e.printStackTrace();
		}
	}

	private void processErrorCode(int nErrorCode) {
		System.err.println("Error Code: " + nErrorCode);
	}

	public void siteStop() {
		bStop = true;
		for (int i = 0; i < nStartPos.length; i++)
			fileSplitterFetch[i].splitterStop();
	}
}

FileSplitterFetch.java文件类：

package nc.pub.ump.download;

import java.io.IOException;
import java.io.InputStream;
import java.net.HttpURLConnection;
import java.net.URL;
/**
 * @author whs
 * @version 1.0
 */
public class FileSplitterFetch extends Thread {
	/* 定义文件传输时使用的变量 */
	String sURL;
	/* 分段文件传输开始位置 */
	long nStartPos;
	/* 分段文件传输结束位置 */
	long nEndPos;
	/* 子线程ID */
	int nThreadID;
	/* 完成文件传输 */
	boolean bDownOver = false;
	/* 停止文件传输 */
	boolean bStop = false;
	FileAccess fileAccess = null;
	/**
	 * 
	 * @param sURL
	 * @param sName
	 * @param nStart
	 * @param nEnd
	 * @param id
	 * @throws IOException
	 */
	public FileSplitterFetch(String sURL, String sName, long nStart, long nEnd, int id) throws IOException {
		this.sURL = sURL;
		this.nStartPos = nStart;
		this.nEndPos = nEnd;
		nThreadID = id;
		// 创建文件并打开
		fileAccess = new FileAccess(sName, nStartPos);
	}
	
	public void run() {
		while (nStartPos < nEndPos && !bStop) {
			// 创建连接
			try {
				URL url = new URL(sURL);
				HttpURLConnection httpConnection = (HttpURLConnection) url.openConnection();
				httpConnection.setRequestProperty("User-Agent", "NextFox");
				String sProperty = "bytes= " + nStartPos + "-";
				httpConnection.setRequestProperty("RANGE", sProperty);
				Utility.log(sProperty);
				// 创建输入流对象
				InputStream input = httpConnection.getInputStream();
				byte[] b = new byte[1024];
				int nRead;
				while ((nRead = input.read(b, 0, 1024)) > 0 && nStartPos < nEndPos && !bStop) {
					nStartPos += fileAccess.write(b, 0, nRead);
				}
				Utility.log("Thread " + nThreadID + "is over!");
				bDownOver = true;
			} catch (Exception e) {
				e.printStackTrace();
			}
		}
	}

	/**
	 * 处理和响应服务器头数据。
	 * @param con
	 */
	public void logResponseHead(HttpURLConnection con) {
		for (int i = 1;; i++) {
			String header = con.getHeaderFieldKey(i);
			if (header != null) {
				Utility.log(header + " : " + con.getHeaderField(header));
			} else {
				break;
			}
		}
	}

	public void splitterStop() {
		bStop = true;
	}
}

FileAccess.java负责文件的存储文件类：

package nc.pub.ump.download;

import java.io.IOException;
import java.io.RandomAccessFile;
import java.io.Serializable;

/**
 * 定义文件访问类。
 * @author whs
 */
public class FileAccess implements Serializable {
	private static final long serialVersionUID = 1L;
	RandomAccessFile oSavedFile;
	long nPos;
	public FileAccess() throws IOException {
		this("", 0);
	}

	public FileAccess(String sName, long nPos) throws IOException {
		oSavedFile = new RandomAccessFile(sName, "rw");
		this.nPos = nPos;
		oSavedFile.seek(nPos);
	}

	public synchronized int write(byte[] b, int nStart, int nLen) {
		int n = -1;
		try {
			oSavedFile.write(b, nStart, nLen);
			n = nLen;
		} catch (IOException e) {
			e.printStackTrace();
		}
		return n;
	}
}

SiteInfoBean.java要抓取的文件的信息，如文件保存的目录，名字，抓取文件的URL等文件类：

package nc.pub.ump.download;

/**
 * 定义获取和设置相关文件类信息。
 * @author whs
 * @version 1.0
 */
public class SiteInfoBean {
	/* 定义URL变量 */
	private String sSiteURL;
	/* 定义存文件路径变量 */
	private String sFilePath;
	/* 定义文件名变量 */
	private String sFileName;
	/* 定义传输文件计数器 */
	private int nSplitter;
	public SiteInfoBean() {
		this("", "", "", 5);
	}
	public SiteInfoBean(String sURL, String sPath, String sName, int nSplitter) {
		sSiteURL = sURL;
		sFilePath = sPath;
		sFileName = sName;
		this.nSplitter = nSplitter;
	}
	/**
	 * @return the nSplitter
	 */
	public int getNSplitter() {
		return nSplitter;
	}

	/**
	 * @param splitter
	 *            the nSplitter to set
	 */
	public void setNSplitter(int splitter) {
		nSplitter = splitter;
	}

	/**
	 * @return the sFileName
	 */
	public String getSFileName() {
		return sFileName;
	}

	/**
	 * @param fileName
	 *            the sFileName to set
	 */
	public void setSFileName(String fileName) {
		sFileName = fileName;
	}

	/**
	 * @return the sFilePath
	 */
	public String getSFilePath() {
		return sFilePath;
	}

	/**
	 * @param filePath
	 *            the sFilePath to set
	 */
	public void setSFilePath(String filePath) {
		sFilePath = filePath;
	}

	/**
	 * @return the sSiteURL
	 */
	public String getSSiteURL() {
		return sSiteURL;
	}

	/**
	 * @param siteURL
	 *            the sSiteURL to set
	 */
	public void setSSiteURL(String siteURL) {
		sSiteURL = siteURL;
	}
}

Utility.java工具类，放一些简单的方法文件类：

package nc.pub.ump.download;

/**
 * 定义输出提示信息及线程sleep类。
 * @author whs
 */
public class Utility {

	public static void sleep(int nSecond) {
		try {
			Thread.sleep(nSecond);
		} catch (Exception e) {
			e.printStackTrace();
		}
	}

	public static void log(String sMsg) {
		System.out.println(sMsg);
	}

	public static void log(int sMsg) {
		System.out.println(sMsg);
	}
}

TestMethod.java测试类文件类：

package nc.pub.ump.download;
/**
 * 
 * @author whs
 * @version 1.0
 */
public class TestMethod {
	/* 定义WEB地址和文件名 */
	//private String sWebAddr = "http://nczx.ufida.com.cn/Submodule/ProjectManage/Webapp/Doc/Problem/201410271357166861_1.rar";
	private String sWebAddr = "http://www.12306.cn/mormhweb/ggxxfw/wbyyzj/201106/srca12306.zip";
	//private String sWebAddr = "http://localhost:8080/FileUpload/test.rar";
	/* 定义存文件路径 */
	private String sSavePath = "E:\\whs";
	/* 定义文件名 */
	private String sSaveName = "test.rar";
	public TestMethod() {
		try {
			SiteInfoBean bean = new SiteInfoBean(sWebAddr, sSavePath, sSaveName, 5);
			SiteFileFetch fileFetch = new SiteFileFetch(bean);
			fileFetch.start();
		} catch (Exception e) {
			e.printStackTrace();
		}
	}
	public static void main(String[] args) {
		new TestMethod();
	}
}

posted @ 2015-04-20 13:48 梦之心上阅读(562) 评论(0) 收藏举报

刷新页面返回顶部

梦之心上

年青，千万不要放弃，要有梦想。

断点续传的原理(转)

公告