6 responses

  1. Rob Ashton
    2018-09-21

    I implemented a system similar to your first concept in a previous role. The only variations I had were that:
    1) I wanted the same job to work whether the instance was an AG cluster or not, and in the case of a non-AG cluster, or a standalone instance, just lump straight to doing the process.
    2) I needed to be able to run the job on versions older than SQL Server 2012, so the checks against the DMVs had to be separated into steps that were called only if the version was high enough to warrant them.

    The job/step green/yellow/red issue vexed me as well, and the only way I saw to improve the situation was to switch over to checking the go/no go decisions and running the process(es) involved in the overall job step as a single PowerShell script. Of course, that wasn’t going to be simple since not all of the instances ran on Windows Server 2008 or higher… *cough* Windows Server 2003/SQL Server 2005 *cough**cough*

    Reply

  2. ikour
    2018-12-17

    what should the privileges of the login be which is being used to run job? Unless it is sysadmin its not giving the same results.

    Reply

    • Wayne Sheffield
      2018-12-18

      Good question. sys.availability_groups requires the “VIEW ANY DEFINITION” permission, and sys.dm_hadr_availability_group_states requires the “VIEW SERVER STATE” permission.
      I typically configure jobs to be owned by “sa”, so they will run with sysadmin.

      Reply

  3. Bill Fritz
    2021-08-02

    Thanks Wayne! Very nice article. The error produced causes our monitoring software to think the job failed even if our Job Step says to Quit Reporting Success. So I had to go another route.
    I also like the ability to use code that doesn’t have to change i.e. Database Name, Listener Name, or even Job Name etc. So I really like what you provided to use a SQL Agent token.

    This gives me a nice clean AG_Test Step 1 that can be added to any SQL Agent Job.

    IF (SELECT primary_replica FROM sys.dm_hadr_availability_group_states) is NULL
    BEGIN
    PRINT ‘This is a not an AG member server. –Not an AG Member Server and Job Execution should continue.’
    END
    ELSE IF (SELECT primary_replica FROM sys.dm_hadr_availability_group_states) = @@Servername
    BEGIN
    PRINT ‘This is a AG Primary Replica. –Do Nothing. This is an AG Primary Replica and Job Execution should continue.’
    END
    ELSE IF (Select primary_replica FROM sys.dm_hadr_availability_group_states) @@Servername
    BEGIN
    PRINT ‘This is a AG Secondary Replica. –Stop Job. This is an AG Secondary Replica and we do not want Job Execution to continue on this Node.’
    DECLARE @job_id UNIQUEIDENTIFIER;
    SET @job_id = CONVERT(UNIQUEIDENTIFIER, CONVERT(VARBINARY(34), $(ESCAPE_SQUOTE(JOBID)), 1));
    EXECUTE msdb.dbo.sp_stop_job @job_id = @job_id;
    END

    The result is that if the Server executing the SQL Agent Job is not an AG member server or is an AG Primary, it does nothing. If it is an AG member server NOT the Primary, it cancels the Job.
    The Job is stopped and the log shows as cancelled. No more error thrown to cause “false” failure alerts.
    Thanks for providing a great explanation and helping me think through this. Hope this Job Step is useful for others in the community.
    Bill

    Reply

    • Wayne Sheffield
      2021-08-13

      Hi Bill,
      I really like what you’ve done here. Looks great!

      Reply

Leave a Reply to let me know how you liked this post

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Back to top
mobile desktop