将ROS从cmm03刀片上转移到cmm02刀片上
1. 在 ../segments/dybnewfarm.data.xml 文件里添加cmm02上节点的描述
2. 在 ../segments/simulation_FADC_xml/wu8.xml 文件里将ROS运行的节点改成cmm02节点
3. 在../segments/FADCEthChan.data.xml 里增加cmm02 节点上的FADCChannel
4. 在 ../segments/simulation_FADC_xml/wu8.xml 上添加cmm02上的FADCChannel
干脆将上述文件都复制一份再修改,文件名加上cmm02的下标,修改include的文件。
然后将part_dk_ef.data.xml里include的其他文件里面的cmm03替换为cmm02.
在cmm02node01上启动取数界面后,点击boot,initial,发生错误:提示系统资源不够。
Write failed: Broken pipe^M ERROR 2017-Mar-07 08:17:09 [void daq::rc::Application::launch(...) at RunController/src/lib/Application.cc:788] Application L2SV-1 could not be started. was caused by: ERROR 2017-Mar-07 08:17:09 [void daq::rc::Application::launch(...) at RunController/src/lib/Application.cc:759] Application L2SV-1 could not be started. Error while trying to start L2SV-1 on host cmm02node01: Error while trying to start the thread reading on the report FIFO for process L2SV-1 in partition part_dk_ef: boost thread: failed in pthread_create: Resource temporarily unavailable. Application control lost!!! was caused by: ERROR 2017-Mar-07 08:17:09 [daq::pmg::Process* daq::pmg::ProcessDescriptionImpl::start(...) at ProcessManager/src/lib/ProcessDescriptionImpl.cxx:376] Error while trying to start L2SV-1 on host cmm02node01: Error while trying to start the thread reading on the report FIFO for process L2SV-1 in partition part_dk_ef: boost thread: failed in pthread_create: Resource temporarily unavailable. Application control lost!!! Write failed: Broken pipe^M ERROR 2017-Mar-07 08:17:36 [static ers::Issue* daq::OnlRec::ERSExtension::createIssue(...) at OnlineRecovery/src/ERSExtension.cpp:106] Raised Error state because of "L2SV-1" APPLICATION_FAILED
[lhaaso@cmm02node01 ~]$ ulimit -a core file size (blocks, -c) 0 data seg size (kbytes, -d) unlimited scheduling priority (-e) 0 file size (blocks, -f) unlimited pending signals (-i) 257480 max locked memory (kbytes, -l) 64 max memory size (kbytes, -m) unlimited open files (-n) 1024 pipe size (512 bytes, -p) 8 POSIX message queues (bytes, -q) 819200 real-time priority (-r) 0 stack size (kbytes, -s) 10240 cpu time (seconds, -t) unlimited max user processes (-u) 1024 virtual memory (kbytes, -v) unlimited file locks (-x) unlimited
用ps -ef|wc -l 查看系统的进程数,没有超过1024。
[lhaaso@cmm02node01 ~]$ ps -ef |wc -l
550
[lhaaso@cmm02node01 ~]$ ps -ef |wc -l
627
[lhaaso@cmm02node01 ~]$ ps -ef |wc -l
-bash: fork: retry: Resource temporarily unavailable
-bash: fork: retry: Resource temporarily unavailable
628
[lhaaso@cmm03node01 ~]$ ulimit -a core file size (blocks, -c) 0 data seg size (kbytes, -d) unlimited scheduling priority (-e) 0 file size (blocks, -f) unlimited pending signals (-i) 257490 max locked memory (kbytes, -l) unlimited max memory size (kbytes, -m) unlimited open files (-n) 32768 pipe size (512 bytes, -p) 8 POSIX message queues (bytes, -q) 819200 real-time priority (-r) 0 stack size (kbytes, -s) 10240 cpu time (seconds, -t) unlimited max user processes (-u) 32768 virtual memory (kbytes, -v) unlimited file locks (-x) unlimited
在cmm03上启动一次界面,作为对比,取数界面从启动到点击INITIALIZE增加了76个进程。
[lhaaso@cmm03node01 ~]$ ps -ef |wc -l //未启动界面时 661 [lhaaso@cmm03node01 ~]$ ps -ef |wc -l // 界面启动后 729 [lhaaso@cmm03node01 ~]$ ps -ef |wc -l //点击了BOOT后 733 [lhaaso@cmm03node01 ~]$ ps -ef |wc -l // 点击了INITIALIZE后 737
修改limits.conf文件(修改为cmm03刀片一样)增加系统允许运行的进程数量,修改后,取数界面可以正常。
现在有一个问题,修改这个文件里的参数会不会对系统的性能有影响呢?
* soft memlock unlimited * hard memlock unlimited lhaaso soft nofile 32768 lhaaso hard nofile 65536 lhaaso soft nproc 32768 lhaaso hard nproc 65536 zengtx soft nofile 32768 zengtx hard nofile 65536 zengtx soft nproc 32768 zengtx hard nproc 65536
修改limits.conf后,界面正常启动,系统的进程数增加了74个。
[lhaaso@cmm02node01 tdaq-04-00-01-juno]$ ps -ef|wc -l // 未启动界面时 504 [lhaaso@cmm02node01 partitions]$ ps -ef|wc -l //启动界面后 571 [lhaaso@cmm02node01 partitions]$ ps -ef|wc -l //点击boot后 575 [lhaaso@cmm02node01 partitions]$ ps -ef|wc -l // 点击INITIALIZE后 578