Technology Architect
ExternalPrepare for this interview
EliteAI-generated questions, company research, and talking points tailored to this role
Benefits
Additional Information
Platform Operations & Technical Ownership 3rd-Level Technical Support & Troubleshooting as key knowledge resource Acts as the primary 3rd-level contact for: Wazuh SIEM PostgreSQL S3 MinIO Object Storage DNS Infrastructure Remote platform access / bastion systems Linux OS (SuSE, RHEL, Ubuntu) NSX‑T networking and firewalling SuSE Manager Performs deep root-cause analyses including multi-system debugging. Handles cross-team, business-critical incidents requiring broad platform knowledge. Capacity & Performance Management End-to-end responsibility for FCI and Kubernetes cluster capacity management. Continuous assessment of resource utilization, trends, and scaling requirements. Platform Stability & Reliability Drives improvements in platform stability and deployment reliability. Optimizes operational models and CI/CD processes. Ensures smooth transitions from project delivery to stable operations. 2. Platform Engineering & Automation Prepares, designs, and executes Proofs of Concept (PoCs) for: Ansible / AWX to enable automated deployments and configuration management. Oracle-related technologies, including integration and migration scenarios. Develops automation strategies and contributes reusable modules and deployment templates. Defines technical standards for automated operations. 3. Security, Compliance & Governance Audit Management & Collaboration with Auditors Designs, reviews, and explains technical audit controls to internal and external auditors. Coordinates audit activities for both platform and application-related topics. Security-Driven Engineering Embeds security controls into automated deployment workflows. Creates and maintains compliance policies and technical guardrails. Wazuh SIEM Responsibility Designs, maintains, and operates the Wazuh security platform. Develops use cases, alerts, dashboards, and security incident processes. Troubleshoots performance issues, agent behavior, and platform scalability. 4. Collaboration, Stakeholder Management & Enablement Coordinates work packages across AO teams, development teams, and infrastructure units. Works closely with software teams to onboard applications onto the platform. Supports service portfolio development and provides technical input for presales activities. Shares best practices and mentors engineers regarding platform processes and tools. 5. Architecture, Design & Technology Evaluation Executes PoCs and evaluates new platform components. Defines integration strategies for new technologies in alignment with architecture standards. Creates reference architectures, deployment blueprints, and operational concepts. Evaluates solutions based on scalability, resilience, security, and cost efficiency. 6. Project Involvement Project: Icinga Replacement Coordinates work and dependencies with classic AO teams. Supports AO teams in deploying and configuring exporters/agents on legacy VMs. Standardizes client-side configurations and data mappings. Implements standardized dashboards for platform service observability. Defines monitoring and alerting for existing components and applications. Performs advanced troubleshooting, including: missing or incomplete metrics high scrape latency time-series cardinality challenges Kubernetes monitoring (Prometheus Operator, ServiceMonitor/PodMonitor resources) Project: MIF Analysis of the existing application architecture and its components. Conducts PoC for Cognos. Supports DB2 → PostgreSQL migration, including data validation, performance assessment, and migration tooling. 7. Technical Skills & Competencies Linux Platform Engineering & Operations Advanced administration of enterprise-grade Linux systems (RHEL, Ubuntu, hardened distributions). Deep OS-level troubleshooting (CPU, memory, IO bottlenecks, process diagnostics). Service lifecycle management using systemd, including journald log analysis. Kernel parameter tuning, optimization, and performance diagnostics. Host-level incident investigation and forensic log analysis. Definition and execution of patching and lifecycle management strategies. Filesystem operations and troubleshooting (LVM, XFS, ext4, mount and IO issues). User and remote access configuration, including SSH hardening and bastion host concepts. Kubernetes Platform Operations Operational support for Kubernetes clusters across control plane and worker nodes. Troubleshooting pod failures, scheduling issues, container crashes, and resource exhaustion. Debugging of networking-related problems (CNI layers, service routing, DNS resolution). Management of persistent volumes, storage classes, and dynamic provisioning behaviors. Resource forecasting and capacity planning for cluster growth (CPU, memory, storage). Execution and validation of Kubernetes cluster upgrades. Operational support for multi-cluster and multi-environment setups. Analysis of Kubernetes system logs (kube-api, kubelet, controller-manager). Maintenance and enhancement of the Kubernetes stack, including version upgrades an
Your Match
How well this role fits your profile.
Company Intel
What employees say
Worked at T-Systems ICT India Pvt. Ltd.? Share your experience