Reliability Engineer

Long Finch Technologies

Posted On: Jan 08, 2024

Posted On: Jan 08, 2024

Job Overview

Job Type

Contract - W2

Experience

8 - 15 Years

Salary

$70 Per Hour

Work Arrangement

Remote

Travel Requirement

0%

Required Skills

  • Splunk
  • DataDog
  • DynaTrace
  • Reliability
  • PowerShell / Python / Shell
  • Application Performance Management / APM
Job Description

Job Description

  • Develop and maintain comprehensive monitoring solutions for cloud-based services and applications.
  • Configure monitoring tools and systems to collect relevant metrics, logs, and traces.
  • Create custom monitoring dashboards and reports using Splunk, DataDog, DynaTrace or other tools, to provide real-time insights into system performance and health.
  • Continuously monitor the cloud infrastructure's performance and capacity, anticipating and addressing potential scalability issues.
  • Proactively suggest and implement improvements to enhance the system's reliability, resilience, and fault tolerance.
  • Work on automating tasks to streamline operational processes and reduce manual intervention.
  • Collaborate with cross-functional teams to investigate and resolve critical incidents, ensuring minimal impact on end-users.
  • Work with Problem Management team to complete post-mortem analysis of incidents to identify root causes and implement preventive measures.
  • Understand the overall architecture of our systems to identify gaps in monitoring and troubleshoot issues.
  • Configure and maintain custom dashboards and alerts in various monitoring tools.
  • Create custom reports, deliver report presentations to various stakeholders.
  • Develop scripts for monitoring PowerShell, Python, Shell scripting.
  • Develop metrics for both the business and technical teams to determine the health of systems.
  • Provide on-call support as needed.
  • Leads and coordinates performance engineering for medium to large initiatives.
  • Collect and document expected system performance and operational characteristics.
  • Collect and/or prepare test data for test execution.
  • Develop and execute performance tests including load, stress, endurance, fail-over and interoperability.
  • Conduct technical analysis of performance test results and production systems, and provide recommendations on performance tuning, systems, and infrastructure. Identify, report, and review defects in assessing system performance and stability.
  • Defining the strategy for enabling performance diagnostics and monitoring using an Application Performance Management (APM) tool, other monitoring tools, and diagnostic techniques.
  • Collaborating with developers to promote the concept of performance engineering during all phases of the SDLC to detect and correct performance issues earlier in the lifecycle.
  • Leads peer reviews to ensure the completeness of all test assets created.
  • Resolve performance and stability issues in performance test environment.
  • Develop performance engineering work plan structure and project schedule.
  • Review architectural design for performance risks and potential issues.
  • Prepare capacity analysis when applicable.

Job ID: LF240010


Posted By

Andy

HR Manager


Related Jobs
  • Contract - W2
  • Contract - Independent
  • Contract - Corp-to-Corp

  • Company
  • COMPANY

    PB Consulting

  • Company
  • experience

    10 - 20 Years

  • Travel Requirements
  • Work Arrangement

    Hybrid

  • Wallet
  • SALARY

    $70 - $75 Per Hour

  • Skills
  • SKILLS

    • SRE
    • observability
    • Grafana
    • GCP

Posted On: Nov 19, 2024

  • Contract - Corp-to-Corp
  • Contract - Independent
  • Contract - W2

  • Company
  • COMPANY

    PB Consulting

  • Company
  • experience

    7 - 15 Years

  • Travel Requirements
  • Work Arrangement

    On-Site

  • Wallet
  • SALARY

    $58 - $62 Per Hour

  • Skills
  • SKILLS

    • SRE
    • L2 Production Support
    • AWS Lambda
    • ServiceNow

Posted On: Oct 28, 2024

  • Contract - W2
  • Contract to Hire - W2
  • Contract - Independent
  • Contract to Hire - Independent

  • Company
  • COMPANY

    Long Finch Technologies

  • Company
  • experience

    5 - 10 Years

  • Travel Requirements
  • Work Arrangement

    On-Site

  • Wallet
  • SALARY

    Depends on Experience

  • Skills
  • SKILLS

    • Lead

Posted On: Oct 03, 2024

  • Contract - Independent
  • Contract - W2
  • Contract - Corp-to-Corp

  • Company
  • COMPANY

    Techvilla Solutions

  • Company
  • experience

    8 - 14 Years

  • Travel Requirements
  • Work Arrangement

    Remote

  • Wallet
  • SALARY

    $55 - $60 Per Hour

  • Skills
  • SKILLS

    • SRE
    • AWS
    • Linux
    • Cloud
    • +4 more

Posted On: Sep 27, 2024

  • Full-time

  • Company
  • COMPANY

    Long Finch Technologies

  • Company
  • experience

    5 - 8 Years

  • Travel Requirements
  • Work Arrangement

    On-Site

  • Wallet
  • SALARY

    $90,000 - $140,000 Per Year

  • Skills
  • SKILLS

    • Reliability engineer
    • failure analysis
    • TEM
    • JEDEC

Posted On: Sep 23, 2024