[No0000145]深入浅出图解C#堆与栈 C# Heap(ing) VS Stack(ing)理解堆与栈2/4

前言

 
虽然在.Net Framework 中我们不必考虑内在管理和垃圾回收(GC),但是为了优化应用程序性能我们始终需要了解内存管理和垃圾回收(GC)。另外,了解内存管理可以帮助我们理解在每一个程序中定义的每一个变量是怎样工作的。
 

简介

 
这篇文章我们将介绍一些方法参数传递行为在堆与栈中的影响。前几节我们介绍了堆与栈的基本工作原理,程序执行时值类型与引用类型在堆栈中的存储。另外,我们已经介绍了一些关于指针的基本知识。这一节中参数传递对堆栈的影响很重要,下面会慢慢道来。
 

参数,大画面

下面是当代码运行时会产生的一个详细过程。上几节已经介绍过当一个方法被调用时会产生的基本情况,让我们来看一下更加详细的内容。
当我们调用一个方法时会发生以下情形:
  1. 栈会分配一块内存空间给程序执行所需要的信息(我们叫它栈结构Stack Frame)。一个栈结构包含方法调用地址(指针),它以一个GOTO指令的形式存在栈里。因此,当程序执行完方法(method)时,它会知道怎么样返回并继续执行代码。
  2. 方法的所有参数将被复制到栈里,这是我们将要更加详细介绍的部分。
  3. 控制被传递到JIT编译过的方法里,同时线程开始执行代码。此时,我们将有另一个方法呈现在栈结构的“回调栈”里。
代码:
[csharp] view plain copy
 
  1. public int AddFive(int pValue)  
  2.           {  
  3.                 int result;  
  4.                 result = pValue + 5;  
  5.                 return result;  
  6.           }  

栈像下图所示:
 
注意:ReturnValue方法不会存在栈上,图中把ReturnValue作为此栈结构的开始只是为了解释栈原理。
 
像前几节介绍的,值类型和引用类型在栈里的存储是不同的。栈为任何值类型创建副本,栈也为任何引用类型的指针创建副本。
 

值类型传递

下面是值类型传递在栈里的内幕。
 
首先,当我们传递一个值类型变量时,栈会为它分配一块内存空间并把值类型变量的值存储进去。看下面的代码:
[csharp] view plain copy
 
  1. class Class1  
  2.      {  
  3.           public void Go()  
  4.           {  
  5.               int x = 5;  
  6.               AddFive(x);  
  7.    
  8.               Console.WriteLine(x.ToString());  
  9.                 
  10.           }  
  11.    
  12.           public int AddFive(int pValue)  
  13.           {  
  14.               pValue += 5;  
  15.               return pValue;  
  16.           }  
  17.      }  

当代码执行时,栈为x分配一块内存空间并存储值5
然后,AddFive()被放到栈上,同时栈分配内存空间给参数pValue并复制x的值给它。
当AddFive()执行完成,线程被传递回Go()。同时因为AddFive()执行完,它的参数pValue也实质上被移除。
所以结果是5是合理的。关键点是任何被传递的值类型参数仅是一个碳复制,因为我们希望保护原始变量的值。
有一点要记住的是,如果我们有一个非常庞大的值类型(如,庞大的struct类型)传递到栈里,当处理器循环复制它并循环占有栈空间时将会非常耗资源。栈没有无限的空间去使用,就像用水杯不断的接水早晚会溢出一样。Struct类型可以变得非常庞大,我们要小心并清醒的使用它。
 
下面是一个比较大的struct结构类型:
[csharp] view plain copy
 
  1. public struct MyStruct  
  2.           {  
  3.               long a, b, c, d, e, f, g, h, i, j, k, l, m;  
  4.           }  

让我们看看执行下面代码Go()方法时再到DoSomething()方法会发生的情况:
[csharp] view plain copy
 
  1. public void Go()  
  2.          {  
  3.             MyStruct x = new MyStruct();  
  4.             DoSomething(x);  
  5.                
  6.          }  
  7.   
  8.   
  9.           public void DoSomething(MyStruct pValue)  
  10.           {  
  11.                    // 省略具体实现....  
  12.           }  

这可能会非常低效。想像一下如果我们传递MyStruct几千次,它会怎么样让程序死掉。
 
那么,我们怎么才能回避这样的问题呢?那就是仅传递原始值类型的引用。
public void Go()
          {
             MyStruct x = new MyStruct();
             DoSomething(ref x);
              
          }
 
           public struct MyStruct
           {
               long a, b, c, d, e, f, g, h, i, j, k, l, m;
           }
 
           public void DoSomething(ref MyStruct pValue)
           {
                    // 省略实现....
           }
 
 
这样就能节省内存并提升内存使用效率
 
唯一需要注意的是传递引用时我们在访问原始变量x的值,任可对pValue的改变都会影响到x。
下面的代码会将x改变成"12345",因为pValue.a实际上指向原始x声明时所在的内存地址。
[csharp] view plain copy
 
  1. public void Go()  
  2.           {  
  3.              MyStruct x = new MyStruct();  
  4.              x.a = 5;  
  5.              DoSomething(ref x);  
  6.    
  7.              Console.WriteLine(x.a.ToString());  
  8.                  
  9.           }  
  10.    
  11.           public void DoSomething(ref MyStruct pValue)  
  12.           {  
  13.                    pValue.a = 12345;  
  14.           }  
 

前言

 
虽然在.Net Framework 中我们不必考虑内在管理和垃圾回收(GC),但是为了优化应用程序性能我们始终需要了解内存管理和垃圾回收(GC)。另外,了解内存管理可以帮助我们理解在每一个程序中定义的每一个变量是怎样工作的。
 

简介

 
继续上篇未完成的“参数传递对堆栈的影响”。
 

引用类型传递

 
传递引用类型跟上一节所示例中用引用的方式传递值类型相似。
 
如果使用引用类型(原文可能笔误,写的是值类型):
[csharp] view plain copy
 
  1. public class MyInt  
  2.            {  
  3.                public int MyValue;  
  4.            }  

然后调用Go()方法,MyInt会被放到堆里因为它是一个引用类型。
[csharp] view plain copy
 
  1. public void Go()  
  2.           {  
  3.              MyInt x = new MyInt();                
  4.           }  

 
如果执行下面代码中的Go():
[csharp] view plain copy
 
  1. public void Go()  
  2.           {  
  3.              MyInt x = new MyInt();  
  4.              x.MyValue = 2;  
  5.    
  6.              DoSomething(x);  
  7.    
  8.              Console.WriteLine(x.MyValue.ToString());  
  9.                 
  10.           }  
  11.    
  12.            public void DoSomething(MyInt pValue)  
  13.            {  
  14.                pValue.MyValue = 12345;  
  15.            }  

会发生这种情况:
 
  1. 开始调用Go(),栈分配一块内存空间给x。
  2. 执行行到DoSomething(),栈分配一块内在空间给pValue。
  3. x的值是堆中MyInt对应在栈里的内存地址,复制x给pValue。
因此,我们用pValue改变MyInt的MyValue的值时,x最终也会获得这个改变的值"12345“。
如果我们用引用的方式传递一个引用类型变量呢?
 

用引用的方式传递引用类型

我们有一个类Thing, 类Animal和Vegetables衍生于Thing:
[csharp] view plain copy
 
  1. public class Thing  
  2.            {  
  3.            }  
  4.    
  5.            public class Animal:Thing  
  6.            {  
  7.                public int Weight;  
  8.            }  
  9.    
  10.            public class Vegetable:Thing  
  11.            {  
  12.                public int Length;  
  13.            }  

执行下面的Go()方法:
[csharp] view plain copy
 
  1. public void Go()  
  2.           {  
  3.              Thing x = new Animal();  
  4.              
  5.              Switcharoo(ref x);  
  6.    
  7.               Console.WriteLine(  
  8.                 "x is Animal    :   "  
  9.                 + (x is Animal).ToString());  
  10.    
  11.               Console.WriteLine(  
  12.                   "x is Vegetable :   "  
  13.                   + (x is Vegetable).ToString());  
  14.                 
  15.           }  
  16.    
  17.            public void Switcharoo(ref Thing pValue)  
  18.            {  
  19.                pValue = new Vegetable();  
  20.            }  

x最终变成Vegetable。
打印结果:
[plain] view plain copy
 
  1. x is Animal    :   False  
  2. x is Vegetable :   True  

让我们看看堆栈里到底发生了什么情况
 
  1. 调用Go()方法,栈分配一块内存空间给x。
  2. 堆分配一块内存空间给Animal。
  3. 开始执行Switcharoo()方法,栈分配一块内存空间给pValue并指向x。
  4. 栈分配一块内存空间给Vegetable。
  5. pValue改变了x的值使其指向Vegetable的内在地址。
如果我们不是用ref传递的,打印结果正相反。
 
 

总结

 
我们已经演示了参数传递是怎么在内在中处理的。在接下来的文章里,存储在栈中的引用变量会产生什么情况以及怎么解决对象复制带来的问题。

 

Even though with the .NET framework we don't have to actively worry about memory management and garbage collection (GC), we still have to keep memory management and GC in mind in order to optimize the performance of our applications. Also, having a basic understanding of how memory management works will help explain the behavior of the variables we work with in every program we write. In this article I'll cover some of the behaviors we need to be aware of when passing parameters to methods.

In Part I we covered the basics of the Heap and Stack functionality and where Variable Types and Reference Types are allocated as our program executes. We also covered the basic idea of what a Pointer is.

Parameters, the Big Picture.

Here's the detailed view of what happens as our code executes. We covered the basics of what happens when we make a method call in Part I. Let's get into more detail...

When we make a method call here's what happens:

  1. Space is allocated for information needed for the execution of our method on the stack (called a Stack Frame). This includes the calling address (a pointer) which is basically a GOTO instruction so when the thread finishes running our method it knows where to go back to in order to continue execution.  
  2. Our method parameters are copied over. This is what we want to look at more closely.
  3. Control is passed to the JIT'ted method and the thread starts executing code. Hence, we have another method represented by a stack frame on the "call stack".

The code:

          public int AddFive(int pValue)
          {
                int result;
                result = pValue + 5;
                return result;
          }

Will make the stack look like this:

 

NOTE : the method does not live on the stack, and is illustrated here just for reference as the beginnnig of the stack frame.
 
As discussed in Part I, Parameter placement on the stack will be handled differently depending on whether it is a value type or a reference type. A value types is copied over and the reference of a reference type is copied over.ed over.

Passing Value Types.

Here's the catch with value types...

First, when we are passing a value types, space is allocated and the value in our type is copied to the new space on the stack. Look at the following method:

     class Class1

     {

          public void Go()

          {

              int x = 5;

              AddFive(x);

 

              Console.WriteLine(x.ToString());

              

          }

 

          public int AddFive(int pValue)

          {

              pValue += 5;

              return pValue;

          }

     }

As the method executes, space for "x" is placed on the stack with a value of 5.


 
Next, AddFive() is placed on the stack with space for it's parameters and the value is copied, bit by bit from x.


 
When AddFive() has finished execution, the thread is passed back to Go() and because AddFive() has completed, pValue is essentially "removed":


 
So it makes sense that the output from our code is "5", right? The point is that any value type parameters passed into a method are carbon copies and we count on the original variable's value to be preserved.

One thing to keep in mind is that if we have a very large value type (such as a big struct) and pass it to the stack, it can get very expensive in terms of space and processor cycles to copy it over each time. The stack does not have infinite space and just like filling a glass of water from the tap, it can overflow. A struct is a value type that can get pretty big and we have to be aware of how we are handling it.

Here's a pretty big struct:

           public struct MyStruct

           {

               long a, b, c, d, e, f, g, h, i, j, k, l, m;

           }

Take a look at what happens when we execute Go() and get to the DoSomething() method below:

          public void Go()

          {

             MyStruct x = new MyStruct();

             DoSomething(x);

              

          }

 

 

           public void DoSomething(MyStruct pValue)

           {

                    // DO SOMETHING HERE....

           }

This can be really inefficient. Imaging if we passed the MyStruct a couple thousand times and you can understand how it could really bog things down.

So how do we get around this problem? By passing a reference to the original value type as follows: 

          public void Go()

          {

             MyStruct x = new MyStruct();

             DoSomething(ref x);

              

          }

 

           public struct MyStruct

           {

               long a, b, c, d, e, f, g, h, i, j, k, l, m;

           }

 

           public void DoSomething(ref MyStruct pValue)

           {

                    // DO SOMETHING HERE....

           }

This way we end up with more memory efficient allocation of our objects in memory. 


 
The only thing we have to watch out for when passing our value type by reference is that we have access to the value type's value. Whatever is changed in pValue is changed in x. Using the code below, our results are going to be "12345" because the pValue.a actually is looking at the memory space where our original x variable was declared.

          public void Go()

          {

             MyStruct x = new MyStruct();

             x.a = 5;

             DoSomething(ref x);

 

             Console.WriteLine(x.a.ToString());

               

          }

 

          public void DoSomething(ref MyStruct pValue)

          {

                   pValue.a = 12345;

          }

Passing Reference Types.

Passing parameters that are reference types is similar to passing value types by reference as in the previous example.

If we are using the value type

           public class MyInt

           {

               public int MyValue;

           }

And call the Go() method, the MyInt ends up on the heap because it is a reference type:

          public void Go()

          {

             MyInt x = new MyInt();              

          }

 

If we execute Go() as in the following code ...

          public void Go()

          {

             MyInt x = new MyInt();

             x.MyValue = 2;

 

             DoSomething(x);

 

             Console.WriteLine(x.MyValue.ToString());

              

          }

 

           public void DoSomething(MyInt pValue)

           {

               pValue.MyValue = 12345;

           }

Here's what happens...

 

  1.  Starting with the call to Go() the variable x goes on the stack.
  2. Starting with the call to DoSomething() the parameter pValue goes on the stack.
  3. The value of x (the address of MyInt on the stack) is copied to pValue

So it makes sense that when we change the MyValue property of the MyInt object in the heap using pValue and we later refer to the object on the heap using x, we get the value "12345".

So here's where it gets interesting. What happens when we pass a reference type by reference?

Check it out. If we have a Thing class and Animal and Vegetables are both things:

           public class Thing

           {

           }

 

           public class Animal:Thing

           {

               public int Weight;

           }

 

           public class Vegetable:Thing

           {

               public int Length;

           }

And we execute the Go() method below:

          public void Go()

          {

             Thing x = new Animal();

           

             Switcharoo(ref x);

 

              Console.WriteLine(

                "x is Animal    :   "

                + (x is Animal).ToString());

 

              Console.WriteLine(

                  "x is Vegetable :   "

                  + (x is Vegetable).ToString());

              

          }

 

           public void Switcharoo(ref Thing pValue)

           {

               pValue = new Vegetable();

           }

Our variable x is turned into a Vegetable.

x is Animal    :   False
x is Vegetable :   True

Let's take a look at what's happening:

 

  1. Starting with the Go() method call, the x pointer goes on the stack
  2. The Animal goes on the hea
  3. Starting with the call to Switcharoo() method, the pValue goes on the stack and points to x

  4. The Vegetable goes on the heapthe heap
  5. The value of x is changed through pValue to the address of the Vegetable

If we don't pass the Thing by ref, we'll keep the Animal and get the opposite results from our code.

If the above code doesn't make sense, check out my article on types of Reference variables to get a better understanding of how variables work with reference types.

In Conclusion.

We've looked at how parameter passing is handled in memory and now know what to look out for. In the next part of this series, we'll take a look at what happens to reference variables that live in the stack and how to overcome some of the issues we'll have when copying objects.

For now.

posted @ 2018-05-29 10:54  CharyGao  阅读(327)  评论(0编辑  收藏  举报