Very unstable

Hi, I’ve been using Continua CI for some months now with our FinalBuilder script and Continua works only 2/3 of the time which is not something we want for a “Continuous Delivery” system. Just today 2 errors have occurred:

First one:
"Action has failed due to an error. 
Could not expand query ‘$Agent.FinalBuilder.7.Path$’. The expression ‘$Agent.FinalBuilder.7.Path$’ contains errors: Unknown object : FinalBuilder Unknown object : 7 Unknown object : Path"

Second one:
The agent ‘spade’ which was executing stage ‘Build’ has gone offline. Agent status is Online, Authorized. Agent was last active at 21/03/2014 11:49:34 p.m… Agent communication test failed.

Please tell me how to debug this errors or at least send you some information.

P.S. We are using the latest version 1.0.0.3010 

We’re sorry to hear that you find Continua to be unstable and however we should be able to resolve these issues after some further troubleshooting.

The second error implies that there is a problem communicating with the agent. The server tries to ping the agent and if there is an error logs this to the debug log and returns this message. Reasons for this could be network issues or issues with the agent being too busy to respond. This could in turn prevent the agent properties from being updated and cause the first error.

We’ve had a look through the code and there does seem to be a few areas where we could revise how we recover from agent communication errors and also where we can reduce communication with the agent. We’ll put some effort in the improving these areas for the next build.

Meanwhile to troubleshoot this issue can you check if there are any errors in the Windows Event Log on the agent machine around the time these errors occur? Also check for high CPU or memory usage which indicate that the agent is overly busy? If you can enable the debug log for a while on both the agent and the server and, when one of these errors occurs, send a copy of the both log files to support@finalbuilder.com, this will help us to narrow down the cause of the issue.


 

Thank you Dave, I enabled logging on both the agent and server (which i forgot to mention run in the same machine). Hope to send you logs soon.

Regards

Ok today running the latest version (1.0.0.3092) Continua failed with the following error:

[code]An error occurred while syncing files from the server to the agent. Details: Exception: ProcessException Message: Running C:\Program Files\VSoft Technologies\ContinuaCI Agent\hg\hg.exe with arguments "archive -r fc392369e99570e2a255e2d128327e9ab13d7c17 --subrepos -R C:\CI_WS\Repos\f099a6ee C:\CI_WS\Ws\1986 --config ui.username=Continua --noninteractive" on agent failed with return code 255 and error output: "abort: C:\CI_WS\Ws\1986\Setup.psvince.dll-kzdnab: Invalid access to memory location " Stack Trace: at Continua.Shared.Utils.Mercurial.Run(ProcessArguments args, String workingDir, Func2 checkResult, Boolean runRecoverIfRequired) at Continua.Shared.Utils.Mercurial.Archive(String repository, String destination, String revision, String includes, IEnumerable1 excludes) at Continua.Modules.Builds.Agent.FileSync.AgentRepositoryCache.<>c__DisplayClassd.<Export>b__c() at Continua.Shared.Utils.ReadWriteLockList1.WithReadLock(TId id, CancellationTokenSource cancelTokenSource, Action action) at Continua.Modules.Builds.Agent.FileSync.AgentRepositoryCache.Export(String dest, String revision, String includes, IEnumerable1 excludes, Boolean listFiles) at Continua.Modules.Builds.Agent.AgentRepositoryHelper.CopySourceToWorkspace(String filePattern, IEnumerable1 excludes, String relativeDestination, AgentWorkspaceSyncContext wsCtx, Boolean logRepoFiles) at Continua.Modules.Builds.Agent.AgentBuildHelper.SyncSourceFromServer(IEnumerable1 rules, AgentWorkspaceSyncContext workspaceCtx) at Continua.Modules.Builds.Agent.AgentBuildHelper.InitialiseWorkspaceOnAgent(AgentCallbackProxy proxy, TransportContextDTO source, Guid callId) . [/code]

Some information:
- Both the server and the agent are running in the same machine.

I have the logs but would like to send them to an email since I see they contain some sensible information. Thanks!

Email them support @ finalbuilder.com - In the mean time, try resetting the repository (via the admin menu). The error is mercurial (which we use for transporting the source between the server and the agent) crashing, which is most likely due to corruption.