Re: [exim] exim delivery process hang at 100% cpu

Top Page
Delete this message
Reply to this message
Author: David Saez Padros
Date:  
To: exim-users
Old-Topics: [exim] exim delivery process hang at 100% cpu
Subject: Re: [exim] exim delivery process hang at 100% cpu
Hi

I has some time to debug this problem which today seems to happen
constinuosly and have been able to reproduce the problem with a
exim process with debuggin enabled and get this output:

.....
updating wait-remote_dk_smtp database
locking /var/spool/exim/db/wait-remote_dk_smtp.lockfile
locked /var/spool/exim/db/wait-remote_dk_smtp.lockfile
EXIM_DBOPEN(/var/spool/exim/db/wait-remote_dk_smtp)
returned from EXIM_DBOPEN
opened hints database /var/spool/exim/db/wait-remote_dk_smtp: flags=O_RDWR
dbfn_read: key=senota.com
dbfn_read: key=senota.com:1867468592
dbfn_read: key=senota.com:1867468591
dbfn_read: key=senota.com:1867468590
dbfn_read: key=senota.com:1867468589
dbfn_read: key=senota.com:1867468588
dbfn_read: key=senota.com:1867468587
dbfn_read: key=senota.com:1867468586
dbfn_read: key=senota.com:1867468585
dbfn_read: key=senota.com:1867468584
dbfn_read: key=senota.com:1867468583
dbfn_read: key=senota.com:1867468582
dbfn_read: key=senota.com:1867468581

and this continues with similar of thousands of lines like the last one,
this seems some
kind of integer overlfow error as /var/spool/exim/db/wait-remote_dk_smtp
is just 4Mb
size, maybe is due to the undelaying db libraries ?


El 12/04/2012 10:23, David Saez Padros escribió:
> Hi
>
> Today exim eat all server cpu to the point that i was unable to
> login as root. Once recovered i see one exim process eating 100%
> cpu when delivering a message to a remote host
>
> [log]# top
>   PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
>  2855 exim      20   0 42168 2412 1232 R  100  0.0  15:44.75 exim

>
> [log]# /usr/local/exim/bin/exiwhat
> 2853 delivering 1SIDML-0008Ls-OK: waiting for a remote delivery
> subprocess to finish
> 2855 delivering 1SIDML-0008Ls-OK to fltr-in1.mail.dreamhost.com
> [208.97.132.73] (kvmlcq3@???)
>
> this process never ended and i have to kill it but in the next
> queue run the same message was processed correctly
>



--
Salu-2 y hasta pronto ...

----------------------------------------------------------------
    David Saez
    On-Line Services 2000 S.L.
    http://www.ols.es
----------------------------------------------------------------