Agile Availability Issues Update

Byagile Administrator January 31, 2018February 21, 2023

You have experienced several Agile outages over the last couple of weeks. Our team is continuing to work on resolving the problem, which is happening within our core network infrastructure. Agile crashed on January 18^th, January 23^rd, twice on January 24^th, and again this morning (January 31^st). In each instance, we contained the issue. We now have a playbook for containing the issue if it happens again, including a monitoring tool that provides “early warning” for this type of event. This morning’s outage was contained to 30 minutes of downtime. We apologize for any inconvenience this may have caused you.

Please find a full explanation of the issues and the corrective action below.

On Thursday, January 18th, at approximately 10 AM EST, the network connection between our application servers and data storage became unstable, causing a full Agile systems outage. It took us four hours to stabilize the network and provide access to Agile. All associates were back on by 2:30PM. We executed our root cause countermeasure (RCCM) process, which included evaluating data logged from our network switches and evaluating recent changes in our network topology. We thought a recent change in our network topology was not fully “understood” by our core switches. Its memory needed to be refreshed through a recycling of each of the switches that make up the core network. The switches have run uninterrupted with 100% uptime since January 2014, so we waited until Sunday to do the reboot to minimize the impact if any switches failed during the process. The Sunday recycle was successful. We felt we had corrected the problem.

On Tuesday, January 23^rd, it happened again! This time at 7 AM EST, and the outage lasted 90 minutes. On Tuesday night, we worked with our vendors to evaluate logs and update switch configurations.

On Wednesday, January 24^th, it happened again around 7 AM EST. By now, we have gotten better at containment and were back up in about 30 minutes. Later in the day, around 6 PM EST, the Agile database crashed. It was down for an hour. This crash was due to instability caused by the earlier network outage, so I’m blaming the network again. Our switch vendor identified a memory leak prior to each instability, and recommended we install new firmware on our switches. The installation was completed on Wednesday evening, and we ran stable for 6 days until this morning.

This morning we experienced another outage around 8:30AM EST, and had Agile back on line by 9:00AM EST. We captured an enormous amount of data before and during the outage event that we are now analyzing with our data storage and network switch vendors. I will keep you posted through this blog on further corrective actions as we learn more.

Regards,

Pat Quinn

SVP, Information Systems and Technology

agile Changes to Support Juno

Byagile Administrator May 2, 2017February 21, 2023

As we have announced through sales letters and web training, we have made significant changes in agile to assist with the integration of Juno into the Acuity Business processes. Below are some agile enhancements that will make the process of Quoting and Ordering Juno products even easier! Crossover Application The Crossover application assists with crossing over old…

General Information | Post Sales | Request for Claim

New Releases for AGILE RFC (Request for Claim) and Labor Processes

ByTeresa Clark June 1, 2016February 21, 2023

As we strive to enhance your customer experience and explore opportunities to improve our processes, we are excited about the following updates effective June 6, 2016: Implementation of 2-way communication between Post Sales and AGILE RFC applications Launching a new Contractor Payment Center A significant change will be made in the way we communicate claims…

General Information | Request for Assistance (RFA)

RFA Modification – Submit/Resubmit Messaging

ByGlenton July 14, 2020

When submitting or resubmitting from within the modification form, you will now see a message stating not to close any windows after pressing the submit/resubmit button. This is critical because if the form is closed before the submit/resubmit call has completed, an Acuity work-queue will not generate, which will delay the processing of your request….

General Information | Orders | Quotes

Mark Lighting products are now CIP enabled

Byagile Administrator October 17, 2016February 21, 2023

We are pleased to announce another agile capability enhancement for Mark Lighting. Effective 10/17/2016, all Mark Lighting products will be CIP enabled.

General Information

Consolidated Acceptance Letter for Approved RFWDs!

Byagile Administrator February 23, 2018February 21, 2023

As we continue to make improvements to the RFWD process, we’re excited to announce we have consolidated the approval letters into a single acceptance letter for all approved RFWD requests where multiple product groups are involved. The format of the letter will remain the same; however, the list of fixtures will now include all fixtures…

General Information

Are you ready to use Agile with Microsoft Edge?

ByLisa April 20, 2022April 20, 2022

Microsoft has announced that it will retire and no longer support Internet Explorer 11 as of June 15, 2022. Instead, customers should use Microsoft Edge, which supports business-critical, legacy Internet Explorer sites and also provides a faster, more modern experience for everything else. What does this mean for you? Microsoft Edge will need to be…

2 Comments

Nick Pucci says:

January 31, 2018 at 5:42 pm

Is there an approximate timeframe in which Agile will work on browsers other than IE?

Log in to Reply
Russ Houck says:

February 1, 2018 at 11:47 am

Thanks for all the info Pat. This must be a little frustrating for the team. Good work ! I have complete confidence that it will all be correct and stable very soon.

Log in to Reply

Similar Posts

2 Comments

Leave a Reply Cancel reply