BrightMove - Application Slowness and Availability – Incident details

Application Slowness and Availability

Resolved
Partial outage
Started 8 months agoLasted 2 days

Affected

Power Search and Portal Job Search

Partial outage from 7:38 PM to 4:07 PM

Candidate Experience Portals

Partial outage from 7:38 PM to 4:07 PM

REST API

Partial outage from 7:38 PM to 4:07 PM

BrightMove ATS Application

Partial outage from 7:38 PM to 4:07 PM

Updates
  • Resolved
    Resolved

    We have been monitoring the platform since the performance hotfix deployment yesterday. At this time the engineering & operations team have concluded the incident has been fully resolved. If you have any further issues, please contact support.

  • Monitoring
    Monitoring

    At 1:45 pm ET, the engineering team deployed a query hot fix to improve the stability and performance of the ATS platform. The team has been hands on monitoring the platform for the past 45 minutes and KPIs show significant improvement. Engineering will continue to monitor the platform.

  • Update
    Update

    Engineering team has identified a slow performing query that is impacting system performance. The development team is currently working on a query rewrite that is optimized to alleviate the impact. Engineering is planning to deploy a hotfix later this afternoon, once the optimized query can be verified. We will update this channel with more details once the HF is deployed.

  • Identified
    Identified

    We are aware of continued performance impact this morning and are actively working on performance improvements. The engineering team has deployed additional infrastructure capacity in attempts to improve performance while tuning efforts continue. While the service is online and available to users, we are treating this performance impact as a high severity issue and are working to resolve as soon as possible.

  • Monitoring
    Monitoring

    The engineering team has successfully deployed a performance improvement to ATS. We will monitor the platform for performance and stability until the issue is deemed to be resolved.

  • Identified
    Identified

    The engineering team has identified the root cause of the performance bottleneck in the ATS platform and has made a configuration change to improve stability.

  • Investigating
    Investigating

    At this, we are seeing issues with the BrightMove ATS application performance. We are currently investigating this incident as a severity 1 issue and are working to restore services as quickly as possible.