Message boards : Bug tracker : Infinite loop WU ?
Author | Message |
---|---|
Laurent Lyaudet Send message Joined: 22 Mar 23 Posts: 32 Credit: 233,667,934 RAC: 1,677,622 |
Hello, Usually the open_mt tasks take between 1 hour 30 and 2 hours on my laptop. But today, I had one that was stuck at 99,921 % for 3 hours. I stopped boinc and restarted it, after a few seconds completion percent was down to 74 %. I waited another hour and after 4 hours in total the task was stuck again at 99,921 %. So I suspect there may be an infinite loop. Here is the WU : https://sech.me/boinc/Amicable/workunit.php?wuid=27069146 Here is the task : https://sech.me/boinc/Amicable/result.php?resultid=62461556 Here is the PC : https://sech.me/boinc/Amicable/show_host_detail.php?hostid=201681 I can help reproduce it tomorrow in the end of afternoon. Nothing fancy with my laptop, no overclocking, I'm using latest Ubuntu 23.04. I hope we can find the bug soon and correct it :) Best regards, Laurent Lyaudet |
Sergei Chernykh Project administrator Project developer Send message Joined: 5 Jan 17 Posts: 534 Credit: 72,451,573 RAC: 0 |
I tested this WU on both Windows and Linux CPU versions, and it didn't get into an infinite loop. Windows: 11:00:11 (7588): Can't open init data file - running in standalone mode Linux:
It took around 35-36 minutes on 7950X, so it should take no more than 5-6 hours even on weaker CPU. It's one of those WUs that are better suited for GPU application. |
Laurent Lyaudet Send message Joined: 22 Mar 23 Posts: 32 Credit: 233,667,934 RAC: 1,677,622 |
Hello :) Thanks for your answer. Maybe it was not an infinite loop but I did get an error as is logged in the task: <core_client_version>7.20.5</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63)</message> <stderr_txt> Factorization '0' is incorrect: 0 is not a prime SIGABRT: abort called Stack trace (10 frames): [0x437140] [0x457be0] [0x457ac9] [0x52b2ea] [0x405696] [0x4067c0] [0x4009a4] [0x51ab66] [0x51ad5a] [0x426549] Exiting... </stderr_txt> ]]> The error stopped the calculus 20 minutes ago after 4 heures 13 min 23 sec of computing. I hope you will be able to tell if it is an error due to my laptop or if the code has a bug or a random error. Thanks, best regards, Laurent Lyaudet |
Sergei Chernykh Project administrator Project developer Send message Joined: 5 Jan 17 Posts: 534 Credit: 72,451,573 RAC: 0 |
Factorization '0' is incorrect: 0 is not a prime This error means that your checkpoint file is corrupt. You should manually delete it from working folder - the file name is "amicable_checkpoint". |
Laurent Lyaudet Send message Joined: 22 Mar 23 Posts: 32 Credit: 233,667,934 RAC: 1,677,622 |
Thanks :) I'll know it if it happens again. My laptop switched to another task after ending that one with error. Just a reference for myself: amicable_checkpoint file is found in two directories: /var/lib/boinc/slots/0/ /var/lib/boinc/slots/1/ Best regards, Laurent Lyaudet |
Message boards : Bug tracker : Infinite loop WU ?
©2024 Sergei Chernykh