Maybe setting a max failure percentage for the play would help?

http://docs.ansible.com/ansible/playbooks_delegation.html#maximum-failure-percentage

I've not used maximum-failure-percentage myself so I don't know how easy it 
would be to identify the failed hosts to revisit though.

Jon
On Thursday, July 14, 2016 at 6:03:09 PM UTC+1, Dmitry Makovey wrote:
>
> I'm building a playbook for patching our servers, however I keep on 
> getting 
>
> 2016-07-14 09:30:29,186 p=55840 u=dimon |  PLAY [report] 
>> ****************************************************************** 
>> 2016-07-14 09:30:29,211 p=55840 u=dimon |  ERROR! invalid host (
>> somerandomhost1.stanford.edu) specified for playbook iteration
>> 2016-07-14 09:33:06,935 p=65806 u=dimon |   [WARNING]: provided hosts 
>> list is empty, only localhost is available
>
>
> Here's a Gist of the playbook 
> https://gist.github.com/droopy4096/72cabebf90ba76b009d128d7dd82eef2 
>
> In a nutshell: I'm using our server inventory DB (Pakiti) to extract list 
> of hosts registered (wrote my own module for that). Then I walk through 
> those hosts and select ones that are "alive" according to Pakiti into 
> "pakiti_hosts" group. Then we have Facter facts on machines identifying 
> their patching priority, so I do a round of "facter fact gathering" (which 
> fails for some machines) and where it fails I set patch priority to 0. Then 
> I'd like to execute certain set of commands across all those pakiti_hosts 
> (which at present is a mere template being generated for report), but my 
> playbook intermittently fails due to some hosts either not responding or 
> some other things and I have to re-launch entire playbook from start. 
>
> It seems that Ansible is OK with connection failures and skipping over 
> those hosts, however for some reason some of the other errors lead to the 
> above message. I've tried to work around this by introducing "blacklist" 
> group in playbook, but when working with 500+ machines - there's always one 
> or two that would fail. I'd like to complete the execution and revisit 
> those boxes later. I've tried "ignore_errors" but I'd rather not add it to 
> *every* block. What are my options? 
>
> I realize that I could collect hosts via dynamic inventory as well but 
> this way it seemed more natural to me, leveraging ansible facilities for 
> that. 
>
> In other words: how can I make my playbook more resilient? 
>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"Ansible Project" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/ansible-project/9ff51860-8799-40af-b67e-4e6b2fe1e265%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to