This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Connection Timeout Expired --> D1IM job service cfg file corrupt

Hi,

This is not for a customer, it's a phenomenon I'm seeing in a VMWare test lab on a regular basis (at least 3 times a week). The ONLY way that seems to fix this is to do the following in sequence:

  1. Go into Designer on the database server or admin workstation NOT on any other job server:
    1. Edit the job service configurations for every job server
    2. Refresh the account details for EVERY job server in turn
    3. Export each configs to file
    4. Commit the changes to the database
  2. After copying the saved config files onto their respective job servers
    1. Open in Job Service configurator to check the credentials can still be viewed
    2. Overwrite the existing .cfg file
  3. Reboot the job server.

As soon as the server comes up, it starts working properly. For a day or two. Then it breaks again.

Has anyone else seen this problem or is it just me?!

For example, this morning I had a job service working fine. I provisioned AD accounts to a test user and watched the processes complete in Job Queue Info. This afternoon I repeated the exact same test with a different account with the same job server processing it (no config changes have been made at all in the last 8 hours).

I've now had two AdHocProjection entries, one for  ADS_ADSAccount_Insert and the other for ADS_ADSGroup_Update, both sat with execution status TRUE and dependent ScriptExec functions saying FALSE with dbqueue showing nothing and job queue progress flatlined... for two hours.

The logs give no indication at all that anything's broken. For example right now the logs say

<i>2016-12-07 14:29:58 +00:00 - Info: Last process step request succeeded.<x>
<i>2016-12-07 14:31:41 +00:00 - Info: Requesting process steps for queue \SQL1.<x>
<i>2016-12-07 14:31:41 +00:00 - Info: Last process step request succeeded.<x>
<i>2016-12-07 14:31:41 +00:00 - Info: Requesting process steps for queue \SQL1.<x>
<p>2016-12-07 14:31:41 +00:00 - Process step parameter 5BF7E1DA-1959-4250-A980-46CE519C0ACE:
[Job]
    ComponentAssembly=HandleObjectComponent
    ComponentClass=VI.JobService.JobComponents.HandleObjectComponent
    Task=FireGenEvent
    Executiontype=INTERNAL
[Parameters]
    ConnectionProvider=VI.DB.ViSqlFactory,VI.DB
    ConnectionString=Hidden
    EventName=UPDATEUTCOFFSET
    objecttype=DialogTimeZone
    ProcID=A294E707-F699-411B-880A-C881B97B2765
    WhereClause=UID_DialogTimeZone = 'QBM-FF60FBBFC1C18061DF4456004F7B34D2'
    _paramName1=StopTime
    _paramValue1=2200-01-01 00:00:00.000

<x><i>2016-12-07 14:31:41 +00:00 - Info: Last process step request succeeded.<x>
<i>2016-12-07 14:31:41 +00:00 - Info: Loading configuration parameters...<x>
<s>2016-12-07 14:31:41 +00:00 - VI.JobService.JobComponents.HandleObjectComponent - 5BF7E1DA-1959-4250-A980-46CE519C0ACE: Successful
    The event  UPDATEUTCOFFSET was triggered for 1 object(s) of type DialogTimeZone.

<x>

Eventually there'll be a Windows event error thrown when trying to verify the connection either by logging into admin tools from the job server or from verifying the SQL credentials inside job service configurator.

Until the file is fixed you can't even the broken Job Service from Job Service Configurator; it complains "the value is encrypted and cannot be displayed" when you go to view the sqlprovider connection string, then after it throws the same error as Job Service Configurator when you go to select the database.


Connection Timeout Expired.  The timeout period elapsed while attempting to consume the pre-login handshake acknowledgement.  This could be because the pre-login handshake failed or the server was unable to respond back in time.  The duration spent while attempting to connect to this server was - [Pre-Login] initialization=28444; handshake=29498;

The only fix from this point on is to log on to a different job server that is still working, or an admin workstation, and make the change through Designer.

You cannot even stop the One Identity Manager Service once it's in this state  - same error code.

Exception occured during Stop.
Database error -2: Connection Timeout Expired.  The timeout period elapsed while attempting to consume the pre-login handshake
...

Parents
  • Not sure if this is relevant but in the configurator when I open the file, I can see something very similar to (but not exactly the same as) the connection string in the "max number of pending requests" box. I'd have thought the request view limit should be a number. Even if that is the case, the file validates successfully.

    The actual file shows:

    <value name="connectstring">[P]SNIPPED_THIS_CHUNK1/SNIPPED_THIS_CHUNK2_gAAwAAAABAAAABrryf/nZgZWN+K1LpNvDI6AAAAAASAAACgAAAAEAAAAAOi6bcF8rA1kM7RyOeTYnqoAAAAmiSaCXOnuwo91kJ+3cq9ItNN2xTqCATrbBsIAh5SJyFhhVjBy4NqkzZX97J9Rgg/3CuJMcYF6Um01mavqdxva+FBk2LOHHt+YKIYh8b6lrgqR5hrxKTBYrFn6BM+ikJOS0gNff4SB+tSZaO3KWaQYCVy5xkjLTJZxbC/2ysKKP11IvKpjXw/HO7rOMCyK/kBEzpSrtJ8D1XWZy1hShKY7PxGjSyQyDwwFAAAAJHDxumc7UEx4HM8VTi1J9g3QSL6[P]</value>

    <value name="requestqueuelimit">[P]SNIPPED_THIS_CHUNK1/SNIPPED_THIS_CHUNK2_gAAwAAAABAAAAA3rXGRagfmG7NpEBPRLTgvAAAAAASAAACgAAAAEAAAAHgNxigZkb9P+UZ+9f+53CsQAAAAGakAHkNBpA4Y2f0pP1wp0hQAAAD3/F7pddNfNaRYJUwgx5XdhXsRlw==[P]</value>
Reply
  • Not sure if this is relevant but in the configurator when I open the file, I can see something very similar to (but not exactly the same as) the connection string in the "max number of pending requests" box. I'd have thought the request view limit should be a number. Even if that is the case, the file validates successfully.

    The actual file shows:

    <value name="connectstring">[P]SNIPPED_THIS_CHUNK1/SNIPPED_THIS_CHUNK2_gAAwAAAABAAAABrryf/nZgZWN+K1LpNvDI6AAAAAASAAACgAAAAEAAAAAOi6bcF8rA1kM7RyOeTYnqoAAAAmiSaCXOnuwo91kJ+3cq9ItNN2xTqCATrbBsIAh5SJyFhhVjBy4NqkzZX97J9Rgg/3CuJMcYF6Um01mavqdxva+FBk2LOHHt+YKIYh8b6lrgqR5hrxKTBYrFn6BM+ikJOS0gNff4SB+tSZaO3KWaQYCVy5xkjLTJZxbC/2ysKKP11IvKpjXw/HO7rOMCyK/kBEzpSrtJ8D1XWZy1hShKY7PxGjSyQyDwwFAAAAJHDxumc7UEx4HM8VTi1J9g3QSL6[P]</value>

    <value name="requestqueuelimit">[P]SNIPPED_THIS_CHUNK1/SNIPPED_THIS_CHUNK2_gAAwAAAABAAAAA3rXGRagfmG7NpEBPRLTgvAAAAAASAAACgAAAAEAAAAHgNxigZkb9P+UZ+9f+53CsQAAAAGakAHkNBpA4Y2f0pP1wp0hQAAAD3/F7pddNfNaRYJUwgx5XdhXsRlw==[P]</value>
Children
No Data