当我们用C++开发一些C++控制台小工具时,会需要一些用户输入的参数来决定程序如何工作和执行,而用户输入参数的方式大部分都是采用命令行参数的方式。
比如上一篇文章 玩转Windows服务系列——命令行管理Windows服务 中介绍的sc和net工具。
既然命令行参数这么普遍也这么有用,那么就有必要学习一下如何解析命令行参数。
如何解析命令行参数
那么命令行参数要如何解析呢。
最最最笨的办法就是每次程序中需要解析命令行参数时,就写一堆解析的代码,但是这样的效率其实很低的,不如使用开源库中的帮助类来解析。
我所了解的开源库的帮助类有
- ACE库的ACE_Get_Opt
- boost的Program_options
ACE_Get_Opt类的主要使用方法介绍
那么我们主要看一下ACE_Get_Opt类
先看它的构造方法
ACE_Get_Opt (int argc, ACE_TCHAR **argv, const ACE_TCHAR *optstring = ACE_TEXT (""), int skip_args = 1, int report_errors = 0, int ordering = PERMUTE_ARGS, int long_only = 0);
下面是代码中的注释
/** * Constructor initializes the command line to be parsed. All information * for parsing must be supplied to this constructor. * * @param argc The number of @a argv elements to parse. * @param argv Command line tokens, such as would be passed * to @c main(). * @param optstring Nul-terminated string containing the legitimate * short option characters. A single colon ":" * following an option character means the option * requires an argument. A double colon "::" following * an option character means the argument is optional. * The argument is taken from the rest of the current * @a argv element, or from the following @a argv * element (only valid for required arguments; * optional arguments must always reside in the same * @a argv element). The argument value, if any is * returned by the @c opt_arg() method. * @a optstring can be extended by adding long options * with corresponding short options via the * @c long_option() method. If the short option * already appears in @a optstring, the argument * characteristics must match, otherwise it is added. * See @c long_option() for more information. * If 'W', followed by a semi-colon ';' appears in * @a optstring, then any time a 'W' appears on the * command line, the following argument is treated as * a long option. For example, if the command line * contains "program -W foo", "foo" is treated as a * long option, that is, as if "program --foo" had * been passed. * The following characters can appear in @a optstring * before any option characters, with the described * effect: * - '+' changes the @a ordering to @a REQUIRE_ORDER. * - '-' changes the @a ordering to @a RETURN_IN_ORDER. * - ':' changes the return value from @c operator() * and get_opt() from '?' to ':' when an option * requires an argument but none is specified. * * @param skip_args Optional (default 1). The specified number of * initial elements in @a argv are skipped before * parsing begins. Thus, the default prevents * @a argv[0] (usually the command name) from being * parsed. @a argc includes all @a argv elements, * including any skipped elements. * @param report_errors Optional, if non-zero then parsing errors cause * an error message to be displayed from the * @c operator() method before it returns. The * error message is suppressed if this argument is 0. * This setting also controls whether or not an error * message is displayed in @c long_option() encounters * an error. * @param ordering Optional (default is @c PERMUTE_ARGS); determines * how the @a argv elements are processed. This argument * is overridden by two factors: * -# The @c POSIXLY_CORRECT environment variable. If * this environment variable is set, the ordering * is changed to @c REQUIRE_ORDER. * -# Leading characters in @a optstring (see above). * Any leading ordering characters override both * the @a ordering argument and any effect of the * @c POSIXLY_CORRECT environment variable. * @param long_only Optional. If non-zero, then long options can be * specified using a single '-' on the command line. * If the token is not a long option, it is processed * as usual, that is, as a short option or set of * short options. * * Multiple short options can be combined as long as only the last * one can takes an argument. For example, if @a optstring is defined as * @c "abc:" or @c "abc::" then the command line @e "program -abcxxx" short * options @e a, @e b, and @e c are found with @e "xxx" as the argument for * @e c. * However, if the command line is specified as @e "program -acb" only * options @e a and @e c are found with @e "b" as the argument for @e c. * Also, for options with optional arguments, that is, those followed by * "::", the argument must be in the same @a argv element, so "program -abc * xxx" will only find "xxx" as the argument for @e c if @a optstring is * specified as @c "abc:" not @c "abc::". */
构造函数共有七个参数,
argc、argv为程序入口函数main方法的参数,也就是程序的命令行。
optstring 为指定的参数选项,并且是一个字符作为一个选项,主要包含三种形式
- 单纯的一个字符选项,比如 s,表示此选项后面不能添加选项的参数
- 一个字符选项后跟一个冒号,比如 s:,表示此选项后面会有一个参数
- 一个字符后面跟两个冒号,比如 s::,表示此选项后面即可以有参数也可以无参数
skip_args 表示从argv的第几个元素开始,默认为1,一般情况下argv[0]为程序的path
report_errors遇到不识别的参数时是否提示错误
long_only表示是否只包含字符串的选项参数。
下面解释一下 字符选项和字符串选项,也就是 short option 和 long option.
- short option 以 字符’-’开始,比如 -s
- long option 以两个字符’-’开始,日入 --server
这是在默认的情况下,也就是long_only = 0的情况下。而当long_only不等于0的时候,就可以通过-server来表示long option了。
那么来看一个定义的例子
ACE_Get_Opt opt(argc, argv, "s:u:p:", 1, 0);
三个重载方法long_option的介绍
由于构造方法中的optstring只能制定short option,所以必须通过long_option方法来指定long option。
ACE_Get_Opt类中有三个重载的long_option方法,下面来一一介绍
long_option方法一:
/// Adds a long option with no corresponding short option. /** * If the @a name option is seen, @c operator() returns 0. * * @param name The long option to add. * @param has_arg Defines the argument requirements for * the new option. * * @retval 0 Success * @retval -1 The long option can not be added. */ int long_option (const ACE_TCHAR *name, OPTION_ARG_MODE has_arg = NO_ARG);
这个方法可以增加long_option,第一个参数name表示要添加的option的名字,第二个参数表示这个option是否包含参数,下面是OPTION_ARG_MODE的定义:
/// Mutually exclusive option argument mode used by long options. enum OPTION_ARG_MODE { /// Doesn't take an argument. NO_ARG = 0, /// Requires an argument, same as passing ":" after a short option /// character in @a optstring. ARG_REQUIRED = 1, /// Argument is optional, same as passing "::" after a short /// option character in @a optstring. ARG_OPTIONAL = 2 };
- NO_ARG表示没有参数,对应short option中的没有冒号的情况
- ARG_REQUIRED表示必须有参数,对应short option中有一个冒号的情况
- ARG_OPTIONAL表示参数可有可无,对应short option中有两个冒号的情况
此方法可以这样使用
if(opt.long_option("server", ACE_Get_Opt::ARG_REQUIRED) == -1) { cout << "server option can not be added" << endl; }
long_option方法二:
/// Adds a long option with a corresponding short option. /** * @param name The long option to add. * @param short_option A character, the short option that corresponds * to @a name. * @param has_arg Defines the argument requirements for * the new option. If the short option has already * been supplied in the @a optstring, @a has_arg * must match or an error is returned; otherwise, the * new short option is added to the @a optstring. * * @retval 0 Success * @retval -1 The long option can not be added. */ int long_option (const ACE_TCHAR *name, int short_option, OPTION_ARG_MODE has_arg = NO_ARG);
这个方法中增加了一个short_option参数,表示这个long_option与对应的short_option所表达的意义是一致的,当解析命令行时,遇到这个long_option时,就按此处的short option进行处理。
此方法可以这样使用
if(opt.long_option("server", 's', ACE_Get_Opt::ARG_REQUIRED) == -1) { cout << "server option can not be added" << endl; }
long_option方法三:
/// Returns the name of the long option found on the last call to /// @c operator() or 0 if none was found. const ACE_TCHAR *long_option (void) const;
这个方法表示解析命令行时,如果遇到long_option,则可以通过这个方法来获取是哪一个long_option。
命令行的解析
命令行的解析就要依赖 operator()()方法,和opt_arg方法,前者解析遇到的是哪个short option 或者 long option,而 opt_arg方法则获取option的参数。
下面看一下方法的声明
/** * Scan elements of @a argv (whose length is @a argc) for short option * characters given in @a optstring or long options (with no short * option equivalents). * * If an element of @a argv starts with '-', and is not exactly "-" * or "--", then it is a short option element. The characters of this * element (aside from the initial '-') are option characters. If * it starts with "--" followed by other characters it is treated as * a long option. If @c operator() is called repeatedly, it returns * each of the option characters from each of the option elements. * * @return The parsed option character. The following characters have * special significance. * @retval 0 A long option was found * @retval '\?' Either an unknown option character was found, or the * option is known but requires an argument, none was * specified, and @a optstring did not contain a leading * colon. * @retval ':' A known option character was found but it requires an * argument and none was supplied, and the first character * of @a optstring was a colon. @c opt_opt() indicates * which option was specified. * @retval '1' @c RETURN_IN_ORDER was specified and a non-option argument * was found. * @retval EOF No more option characters were found. @c opt_ind() will * return the index in @a argv of the first @a argv element * that is not an option. If @c PERMUTE_ARGS was * specified, the @a argv elements have been permuted so that * those that are not options now come last. * * @note The standards are unclear with respect to the conditions under * which '?' and ':' are returned, so we scan the initial characters of * @a optstring up unto the first short option character for '+', '-', * and ':' in order to determine ordering and missing argument behavior. */ int operator () (void);
/** * For communication from @c operator() to the caller. When * @c operator() finds an option that takes an argument, the argument * value is returned from this method, otherwise it returns 0. */ ACE_TCHAR *opt_arg (void) const;
operator()方法有几种返回值:
- 具体的option 字符,表示命令行中遇到了此 short option,或者关联了short option的 long option
- 0,表示遇到了一个long option,可以通过上面介绍的long_option的方法三来获取一下具体是哪一个long option
- EOF,也就是-1,表示解析结束。
当operator()方法返回一个short option或者 long option时,如果这个option可以有参数,则通过opt_arg方法来获取具体的参数。
完整的命令行解析代码
下面展示一个完整但是比较简单的命令行解析的示例程序代码
#include "ace/Get_Opt.h" #include <iostream> using namespace std; int _tmain(int argc, _TCHAR* argv[]) { //定义了三个short option, 类型为 ARG_REQUIRED ACE_Get_Opt opt(argc, argv, "s:u:p:", 1, 0); //定义三个long option, 并与相应的short option 进行关联 if(opt.long_option("server", 's', ACE_Get_Opt::ARG_REQUIRED) == -1) { cout << "server option can not be added" << endl; } if(opt.long_option("user", 'u', ACE_Get_Opt::ARG_REQUIRED) == -1) { cout << "user option can not be added" << endl; } if(opt.long_option("pwd", 'p', ACE_Get_Opt::ARG_REQUIRED) == -1) { cout << "pwd option can not be added" << endl; } //开始解析 for(int arg = 0; (arg = opt()) != -1;) { switch (arg) { case 's': cout << "server is " << opt.opt_arg() << endl; break; case 'u': cout << "user is " << opt.opt_arg() << endl; break; case 'p': cout << "pwd is " << opt.opt_arg() << endl; break; //解析 long option case 0: if(_stricmp(opt.long_option(), "server") == 0) { cout << "long option server is" << opt.opt_arg() << endl; } break; default: break; } } //如果直接运行,或者使用方式不对,则显示帮助 if(opt.opt_ind() + 2 < argc) { cout << "the usage is : GetOptTest.exe [option]" << endl; cout << "\t [-s] server address" << endl; cout << "\t [-u] user name" << endl; cout << "\t [-p] password of user" << endl << endl; } return 0; }
命令行解析实战
至此命令行解析的基本功能已经都介绍了,而且相应的代码已经完成,那么就让我们来欣赏一下解析的效果吧。
具体的命令行
-s 127.0.0.1 --server 192.168.0.1 -u hbccdf -p pwdfortest
运行效果图
其实功能很简单,ACE_Get_Opt的使用也很简单,但是一点点学习,并进行总结,是我一直需要提高的地方。以后会坚持把学到的东西进行总结,并分享到博客上。
坚持!!!