Version française
Home     About     Download     Resources     Contact us    
Browse thread
OCaml program crashes after computing fine for 2 days during grep on multiMB output file
[ Home ] [ Index: by date | by threads ]
[ Search: ]

[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
Date: -- (:)
From: Andries Hekstra <andries.hekstra@p...>
Subject: OCaml program crashes after computing fine for 2 days during grep on multiMB output file
Dear OCaml-list,

I use OCaml under 64-bit Linux to do signal processing simulations of next 
generation optical storage devices. So far, I have really enjoyed 
programming in OCaml, e.g. as program texts are considerable shorter than 
in C++ for computations that involve many arrays. My computations run for 
many days if not a week, and produce output files of ca. 20 MB. I run them 
in a job queue. 

Recently I have been plagued by programs that crash when I do a "grep" on 
the output file (opened with open_out). E.g. the program has been running 
succesfully for a few days. I do a "grep @ *.out" in the directory to 
monitor progress as important lines in the output file start with a "@". A 
few minutes later I receive mails from the queuing system saying that 
everything crashed.

What is the cause of these crashes? Can somebody give me a clue?

Thanx,

Andries

------------------------------------------------------------
# LSBATCH: User input
qtb -par Exp107.txt > Exp107.txt.log -codes 
gallager_10b_1023l_1048576w.txt
------------------------------------------------------------

Exited with exit code 2.

Resource usage summary:

    CPU time   : 163606.88 sec.
    Max Memory :      3014 MB
    Max Swap   :      3044 MB

    Max Processes  :         3

The output (if any) follows:

Fatal error: exception Sys_error("Stale NFS file handle")




------------------------------------------------------------------------
Dr. Ir. Andries P. Hekstra
Philips Research 
High Tech Campus 27  (WL-1-4.15)
5656 AG Eindhoven
Tel./Fax/Secr. +31 40 27 42048/42566/44051 
   *  Good open source break software for computer users : 
http://www.workrave.org