Google Protocol Buffers和java字符串处理控制

大多数的操作码被从夜晚复制。懒得敲。

直接在源代码和测试结果如下。

serabuffer.proto档。使用下面的命令来生成java代码。

protoc -I=./ --java_out=./ serabuffer.proto

package Feinno.Practice.Learn;

option java_package = "Feinno.Practice.Learn";
option java_outer_classname = "ProtoBufferPractice";

message msgInfo  {
  required int32 ID = 1;
  required int64 GoodID = 2;
  required string Url = 3;
  required string Guid = 4;
  required string Type = 5;
  required int32 Order = 6;
}

以下是java部分代码。直接输出结果，要測试不同字符串长度的，能够直接替换当中的字符串然后run就可以。

package Feinno.Practice.Learn;

import com.google.protobuf.ByteString;

public class Test
{
    public byte[] serialize()
    {
        ProtoBufferPractice.msgInfo.Builder builder=ProtoBufferPractice.msgInfo.newBuilder();  
        builder.setGoodID(100);  
        builder.setGuid("11111-23222-3333-444");  
        builder.setOrder(0);  
        builder.setType("及基于消息的协调机制不适合在某些应用中使用，因此须要有一种可靠的、可扩展的、分布式的、可配置的协调机制来统一系统的状态");  
        builder.setID(10);  
        builder.setUrl("http://www.gufensoso.com/search/?q=java+protocol+buffer");  
        ProtoBufferPractice.msgInfo info=builder.build();        
        byte[] result=info.toByteArray() ;  
        return result;
    }
    
    public void deserialize(ByteString result)
    {
        try{  
            ProtoBufferPractice.msgInfo msg = ProtoBufferPractice.msgInfo.parseFrom(result);  
//            System.out.println(msg);  
        }  
        catch(Exception ex){  
            System.out.println(ex.getMessage());  
        }
    }
    
    public static void main(String[] args) 
    {
        Test t=new Test();
        long s1=System.nanoTime();
        byte[] b = t.serialize();
        long s2=System.nanoTime();        
        System.out.println("序列化后长度是："+b.length+" 耗时："+(s2-s1)); 
        
        long a1=System.nanoTime();  
        t.deserialize(ByteString.copyFrom(b));
        long a2=System.nanoTime();  
        System.out.println("反序列化耗时："+(a2-a1)); 
        
        long s3=System.nanoTime();        
        StringBuilder sb=new StringBuilder();
        sb.append("Ã").append("100").append("Ã").append("11111-23222-3333-444").append("Ã").append("0").append("Ã").append("及基于消息的协调机制不适合在某些应用中使用，因此须要有一种可靠的、可扩展的、分布式的、可配置的协调机制来统一系统的状态").append("Ã").append("10").append("Ã").append("http://xxx.jpg");
        byte[] c=sb.toString().getBytes();
        long s4=System.nanoTime();
        
        System.out.println("拼接字符后长度："+c.length+" 耗时："+(s4-s3));
        long c1=System.nanoTime(); 
        sb.toString().split("Ã");
        long c2=System.nanoTime(); 
        System.out.println("拆字符串耗时："+(c2-c1)); 
        
        
    }
}

1、字符串长度较短时：
序列化后长度是：50 耗时：32115919
拼接字符后长度：56 耗时：36223
反序列化耗时：2484294
拆字符串耗时：305783

2、字符串长度较长时，也就是如今代码中的。

序列化后长度是：265 耗时：20092297
拼接字符后长度：229 耗时：48297
反序列化耗时：2445656
拆字符串耗时：354683

3、字符串长度再加长时
序列化后长度是：435 耗时：26542406
拼接字符后长度：398 耗时：53127
反序列化耗时：5412019
拆字符串耗时：527950

结果：
1、速度方面，拼接字符串速度是序列化的500-1000倍。拆分字符串是反序列化的8倍左右。
2、大小方面。数据小的序列化的体积小，数据大了以后，字符串拼接有优势；
3、上面结果数据事实上是不固定的，和详细的数据有关。

结论：

网上的资料显示。protocol buffer在序列化方面无论提交还是性能方面都是非常优秀的，可是这里的測试结果显示和字符串处理方式来比差距就太明显了。

不要迷信，须要自己測试过才有体会，有些场景。用字符串处理的方式比序列化的方式更有优势。消息中间件假设用字符串的方式来处理。性能应该成倍增加。

posted @ 2015-10-27 16:19 mfrbuaa 阅读(494) 评论(0) 收藏举报

刷新页面返回顶部

Google Protocol Buffers和java字符串处理控制

公告