log in |
Message boards : News : webserver down
Author | Message |
---|---|
Due the high amount of new hosts attached today this killed the webserver. Nearly 300+ hosts are trying to get new apps over the slow upload connection. There was also reached the max connections to the database. I cant do anything at the moment because the bottleneck is the upload of 2Mbit. I will order 6Mbit upload next time but this will not help much. | |
ID: 2685 · Rating: 0 · rate: / Reply Quote | |
The upload connection is still overloaded. More and more new hosts are attaching from the same user (400+ now). I have decreased the max_wus_in_progress to 10 per core and set the RPC time to 20s. As long as all new hosts havent got the apps yet the situation will not be better. | |
ID: 2686 · Rating: 0 · rate: / Reply Quote | |
No worries Reb =) | |
ID: 2687 · Rating: 0 · rate: / Reply Quote | |
No worries Reb =) Thank you Mankka*. A 40Mbit connection would be nice and doesnt affect the amount of hosts. Sending out work (small ones) have not so much traffic as the app itself for newer hosts. | |
ID: 2688 · Rating: 0 · rate: / Reply Quote | |
It seems that all the new hosts got the app now. The server is normalizing slowly but the settings are now back to the last ones. | |
ID: 2689 · Rating: 0 · rate: / Reply Quote | |
Nearly 600 new hosts now. The user is unbelievable. I hope I can add more work soon until the server is faster or we run dry. | |
ID: 2690 · Rating: 0 · rate: / Reply Quote | |
Nearly 600 new hosts now. The user is unbelievable. I hope I can add more work soon until the server is faster or we run dry. You actually answered my next question, as I know you have to do it "manually" and when you upload big amounts of new tasks, the connection is also very stressed.. But I think it's a happy problem for the project and if you need some new stuff, don't forget to add them to the ongoing BU campaign ;) | |
ID: 2691 · Rating: 0 · rate: / Reply Quote | |
Nearly 600 new hosts now. The user is unbelievable. I hope I can add more work soon until the server is faster or we run dry. The main problem is that most of the hosts are erroring out with disk_limit_exceeded and stress the network connection which is causing to crash and I cannot get it alive after a restart of the VM only. I must increase the max_connections and user_connections now in the webserver config due some mysql errors. | |
ID: 2692 · Rating: 0 · rate: / Reply Quote | |
Nearly 600 new hosts now. The user is unbelievable. I hope I can add more work soon until the server is faster or we run dry. ...ok ? but as I told you earlier, make 'em 10 times the needed ones, as the MySQL seems to sit on "old" connections forever before dropping them, and I haven't figured out how to get around it (kill them fast) :( | |
ID: 2693 · Rating: 0 · rate: / Reply Quote | |
Nearly 600 new hosts now. The user is unbelievable. I hope I can add more work soon until the server is faster or we run dry. I have increased now the max_connections to 3000 and user_connections to 300 but the warning told me 42000/1200, thats out of the limit ^^ | |
ID: 2694 · Rating: 0 · rate: / Reply Quote | |
I hope you have sent him a PM if his wus are erroring out with lack of disk space, kinda odd with SRBase ? (If I understood you right ?) | |
ID: 2695 · Rating: 0 · rate: / Reply Quote | |
I hope you have sent him a PM if he's wus are erroring out with lack of disk space, kinda odd with SRBase ? (If I understood you right ?) Yes, I have and no this error is on the hosts side. | |
ID: 2696 · Rating: 0 · rate: / Reply Quote | |
...well, at some point (and VERY soonPLS) you will have to ban him for a while, as it's messing up the whole project (stalled up/downloads, report problems, Boinc client backing off for 24 hrs...), as it's not nice if He's not serious about crunching the wus, sorry ! | |
ID: 2697 · Rating: 0 · rate: / Reply Quote | |
...well, at some point (soon PLS) you will have to ban him for a while, as it's messing up the whole project (stalled up/downloads, report problems, Boinc client backing off for 24 hrs...), not nice if He's not serious & crunch the wus, sorry ! Its a bad timing with all these short WUs and I dont want to ban him. I have reduced the max_WUs_in_progress to 20. The new hosts are still climbing. The back off time of BOINC is a bit mess and should be changed by the devs soon. Since a few hours Iam trying to keep the server alive. Need a break now. We will see how its going until tomorrow and I have no idea how many new hosts are up at this time. Must be an end... | |
ID: 2698 · Rating: 0 · rate: / Reply Quote | |
I understand you, and I'll babysit a while until I have my caches full ;) | |
ID: 2699 · Rating: 0 · rate: / Reply Quote | |
I seriously hope Syracuse University has sanctioned 950+ workstations for this project, otherwise somebody is in serious trouble. Just my 2 cents | |
ID: 2700 · Rating: 0 · rate: / Reply Quote | |
The server has been normalized. New work is incoming. | |
ID: 2706 · Rating: 0 · rate: / Reply Quote | |
I have ordered now a new internet connection with 400Mbit dl / 25Mbit ul but the download is not relevant. The change will be done next week. The download speed is 3MB/s instead 256kb (for all users). This should be reduce the bottleneck of the network. | |
ID: 2716 · Rating: 0 · rate: / Reply Quote | |
Message boards :
News :
webserver down