Base 64编码实现

Base64是MIME邮件中常用的编码方式之一。它的主要思想是将输入的字符串或数据编码成只含有{'A'-'Z', 'a'-'z', '0'-'9', '+', '/'}这64个可打印字符的串，故称为“Base64”。
Base64要求把每三个8Bit的字节转换为四个6Bit的字节（3*8 = 4*6 = 24），然后把6Bit再添两位高位0，组成四个8Bit的字节，也就是说，转换后的字符串理论上将要比原来的长1/3。举个例子：

转换前	aaaaaabb	ccccdddd	eeffffff
转换后	00aaaaaa	00bbcccc	00ddddee	00ffffff

上面的三个字节是原文，下面的四个字节是转换后的Base64编码，其前两位均为0。
转换后，我们用一个码表来得到我们想要的字符串（也就是最终的Base64编码），这个表是这样的：（摘自RFC2045）
                            Table 1: The Base64 Alphabet
      Value Encoding  Value Encoding  Value Encoding  Value Encoding
           0 A            　　17 R            　　34 i            51 z
           1 B            　　18 S            　　35 j            52 0
           2 C            　　19 T            　　36 k            53 1
           3 D            　　20 U            　　37 l            54 2
           4 E            　　21 V             　　38 m            55 3
           5 F            　　22 W            　　39 n            56 4
           6 G            　　23 X             　　40 o            57 5
           7 H            　　24 Y             　　41 p            58 6
           8 I            　　 25 Z            　　42 q            59 7
           9 J            　　26 a              　　43 r            60 8
          10 K            　　27 b            　　44 s            61 9
          11 L            　　28 c            　　45 t            62 +
          12 M            　　29 d            　　46 u            63 /
          13 N            　　30 e            　　47 v
          14 O            　　31 f            　　48 w         (pad) =
          15 P            　　32 g            　　49 x
          16 Q            　　33 h            　　50 y

再看一个例子：

转换前	10101101	10111010	01110110
转换后	00101011	00011011	00101001	00110110
十进制	43	27	41	54
对应码表中的值	r	b	p	2

上面的24位编码，编码后的Base64值为 rbp2
解码同理，把 rbq2 的二进制位连接上再重组得到三个8位值，得出原码。(解码只是编码的逆过程)

编码的过程总结如下：
1. 第一个字符通过右移2位获得第一个目标字符的Base64表位置，根据这个数值取到表上相应的字符，就是第一个目标字符。
2. 然后将第一个字符左移4位加上第二个字符右移4位，即获得第二个目标字符。
3. 再将第二个字符左移2位加上第三个字符右移6位，获得第三个目标字符。
4. 最后取第三个字符的右6位即获得第四个目标字符。
5. 在以上的每一个步骤之后，再把结果与 0x3F 进行 AND 位操作，就可以得到编码后的字符了。

原文的字节不够的地方可以用全0来补足，转换时Base64编码用=号来代替。这就是为什么有些Base64编码会以一个或两个等号结束的原因，但等号最多只有两个。因为：
余数 = 原文字节数 MOD 3
所以余数任何情况下都只可能是0，1，2这三个数中的一个。如果余数是0的话，就表示原文字节数正好是3的倍数（理想情况）。如果是1的话，为了让Base64编码是4的倍数，就要补2个等号；同理，如果是2的话，就要补1个等号。

//======================================
//Header File
#ifndef _BASE64HELPER_H_
#define _BASE64HELPER_H_
//
#include <string>
//
namespace sfsMUtils {

class Base64Helper
{
public:

    // convert from Base64 to ANSI
    static std::string encode(const std::string in_str);

    // convert from ANSI to Base64
    static std::string decode(const std::string in_str);

private:

    // encode table
    const static std::string _base64_encode_chars;
 
    // decode table
    const static char _base64_decode_chars[128];
};

} // namespace sfsMUtils

#endif  // _BASE64HELPER_H_

//=======================================
//Source File
#include "sfsBase64Helper.h"
//
namespace sfsMUtils 
{

const std::string Base64Helper::_base64_encode_chars = 
    "ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz0123456789+/";

const char Base64Helper::_base64_decode_chars[] = 
{
    -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1,
    -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1,
    -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, 62, -1, -1, -1, 63,
    52, 53, 54, 55, 56, 57, 58, 59, 60, 61, -1, -1, -1, -1, -1, -1,
    -1,  0,  1,  2,  3,  4,  5,  6,  7,  8,  9, 10, 11, 12, 13, 14,
    15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, -1, -1, -1, -1, -1,
    -1, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40,
    41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, -1, -1, -1, -1, -1
};

std::string Base64Helper::encode(const std::string in_str)
{
    std::string out_str;
    unsigned char c1, c2, c3;
    int i = 0;
    int len = in_str.length();

    while ( i<len )
    {
        // read the first byte
        c1 = in_str[i++];
        if ( i==len )       // pad with "="
        {
            out_str += _base64_encode_chars[ c1>>2 ];
            out_str += _base64_encode_chars[ (c1&0x3)<<4 ];
            out_str += "==";
            break;
        }

        // read the second byte
        c2 = in_str[i++];
        if ( i==len )       // pad with "="
        {
            out_str += _base64_encode_chars[ c1>>2 ];
            out_str += _base64_encode_chars[ ((c1&0x3)<<4) | ((c2&0xF0)>>4) ];
            out_str += _base64_encode_chars[ (c2&0xF)<<2 ];
            out_str += "=";
            break;
        }

        // read the third byte
        c3 = in_str[i++];
        // convert into four bytes string
        out_str += _base64_encode_chars[ c1>>2 ];
        out_str += _base64_encode_chars[ ((c1&0x3)<<4) | ((c2&0xF0)>>4) ];
        out_str += _base64_encode_chars[ ((c2&0xF)<<2) | ((c3&0xC0)>>6) ];
        out_str += _base64_encode_chars[ c3&0x3F ];
    }

    return out_str;
}

std::string Base64Helper::decode(const std::string in_str)
{
    std::string out_str;
    char c1, c2, c3, c4;
    int i = 0;
    int len = in_str.length();

    while ( i<len)
    {
        // read the first byte
        do {
            c1 = _base64_decode_chars[ in_str[i++] ];
        } while ( i<len && c1==-1);

        if ( c1==-1)
            break;

        // read the second byte
        do {
            c2 = _base64_decode_chars[ in_str[i++] ];
        } while ( i<len && c2==-1);

        if ( c2==-1 )
            break;

        // assamble the first byte
        out_str += char( (c1<<2) | ((c2&0x30)>>4) );

        // read the third byte
        do {
            c3 = in_str[i++];
            if ( c3==61 )       // meet with "=", break
                return out_str;
            c3 = _base64_decode_chars[ c3 ];
        } while ( i<len && c3==-1);

        if ( c3==-1 )
            break;

        // assamble the second byte
        out_str += char( ((c2&0XF)<<4) | ((c3&0x3C)>>2) );

        // read the fourth byte
        do {
            c4 = in_str[i++];
            if ( c4==61 )       // meet with "=", break
                return out_str;
            c4 = _base64_decode_chars[ c4 ];
        } while ( i<len && c4==-1 );

        if ( c4==-1 )
            break;

        // assamble the third byte
        out_str += char( ((c3&0x03)<<6) | c4 );
    }

    return out_str;
}

} // namespace sfsMUtils

posted @ 2012-07-17 11:09 友学友阅读(378) 评论(0) 编辑收藏举报

刷新页面返回顶部

生命不在于得到过什么，而在于曾经经历过什么

让一切随风飘动

Base 64编码实现

公告