Support BI Data Architecture across all clients and disciplines focusing on technology, processes, and tools, ensuring that all BI team is technically equipped to perform their jobs. Main responsibilities include:
Design and execution of modern data architectures:
- Development of data ecosystems using tools like Snowflake, SQL Server, ensuring scalability, performance, and security.
- Creation of advanced ETL pipelines to centralize, harmonize, and categorize data from heterogeneous sources (Data Warehouse, marketing systems, Digital Media APIs, and others that can be integrated).
- Implementation of Data Mesh-based architectures to promote decentralization of data access.
Advanced cloud platform management:
- Optimization of cloud platforms (AWS, Azure, Snowflake) for distributed data processing, reducing costs, and increasing efficiency.
- Intensive use of Snowflake for managing large data volumes with features like data sharing, time travel, and multi-cluster warehouses.
Automation and APIs for real-time integration:
- Development and consumption of RESTful and GraphQL APIs to enable real-time integration flows between internal and external systems.
- Automation of deployments and data pipelines using tools like GitHub Actions, Terraform.
Data governance and security:
- Definition of governance policies.
- Oversight of processes to ensure data quality and regulatory compliance.
Leadership and strategic collaboration:
- Coordination of technical teams on high-impact projects, ensuring alignment with business objectives.
- Fostering a data-driven culture through training and the establishment of best practices in the use of cloud platforms and BI tools.
Support Areas:
- Snowflake: Management of Snowflake-related requests (access, new schemas, permissions, etc.).
- Remotes (Azure, AWS): Azure and AWS support for the Regional Cloud Engineering IT team, initial contact: Danilo Ho, Senior Systems Administrator, last instance: Emma Gutierrez, Senior Manager, Cloud Engineering. For incident resolution, please first submit a ticket by email to ESD.
- Alteryx: Support for activation and use of Alteryx licenses with the Regional Cloud Engineering IT team. Initial contact: Danilo Ho, Senior Systems Administrator, for application support and Alteryx server Gallery access. For support requests and new requests, please submit a ticket by email to ESD.
- GitHub: Support for adding to the repository: Oballe, Humberto, IT Developer Manager LATAM, for requesting additions or changes of permissions in the LA-MBW-BI_ENGINEERING GitHub repository. AWS Cloud Engineering IT Regional team with Villacreses, Carlos, Senior Cloud Engineer.
Tasks:
Snowflake:
- Support connections and address issues that may arise in the workflows managed by the LATAM Insights & Analytics team members (current leader: Claudia Vargas). Additionally, manage updates to application items, monitor optimal functioning of schemas and tables storing information, manage and supervise Snowflake connections.
- Design and configure database structures, schemas, roles, and access policies.
- Evaluate updates or new Snowflake capabilities, such as data sharing, unstructured data support, or the use of Snowpark.
- Administer ODBC connections for remote teams connected to SF.
- Help resolve incidents related to workloads, slow queries, or access issues.
- User management: create, modify, and revoke user access, setting appropriate roles and permissions based on the least privilege principle.
- Monitor failures caused by misconfiguration when creating new tables in the GR_KINESSO database, which consists of several schemas created and used depending on the client.
- Technical advice: Guide team members in optimizing SQL queries, using advanced features like time travel and cloning, and following Snowflake best practices.
- Ensure data integrity (information security).
- Data modeling: Assist in creating efficient data models for reports and analysis.
- Automation: Configure and optimize data pipelines using tools like Snowpipe or integrations with ETL processes.
- Query optimization: Identify inefficient queries and collaborate to improve them, reducing costs and improving performance.
- Supervise permissions granted to different schemas, primarily: (MBLA_AGENCY_LA, MBLA_NESTLE_LA).
- Ensure that the architecture is designed to handle growth in data volume and users.
Remotes (Azure, AWS):
- Support the operation of these teams.
- Ensure that each remote has the minimum operability tools required for the assigned user.
- Manage connections to different data sources.
- Manage applications used for user task fulfillment.
Alteryx:
- Alteryx licenses: Ensure the allocation of Alteryx licenses to team users. If a new license is required, manage it according to the needs.
- Keep versions updated to ensure compatibility with the Alteryx server (Gallery).
- User management: Configure roles and permissions for users, ensuring they have appropriate access to workflows and data.
- Monitor the Alteryx server (Gallery): Oversee server performance, disk space, and resources to avoid bottlenecks.
- Updates and patches: Coordinate the installation of new versions or updates to leverage improvements and new features.
- Troubleshoot workflow errors: Assist in diagnosing and resolving issues with workflows that don't work properly.
- Workflow optimization: Identify and suggest improvements to optimize workflow efficiency and execution time.
- Technical assistance: Provide support to users to resolve issues related to specific tools or configurations.
- Scheduling: Help schedule workflows to run automatically on Alteryx Server.
- Validate results: Ensure processed data meets expected standards and is error-free.
- Integration with other tools: Facilitate Alteryx connection with external systems (databases, cloud platforms, APIs, etc.).
- Review workflows: Identify redundant workflows or those that could benefit from best practices.
- Auditing: Perform regular reviews to ensure compliance with corporate standards, security, and quality.
- Innovation: Propose new ways to leverage Alteryx to solve problems or automate processes.
Github:
- Implement rules for key branches, such as requiring code reviews before merge or triggering automatic tests.
- Define conventions for branch names, commits, and pull request (PR) messages.
- Configure GitHub Actions or similar integrations to run tests, verify linting, or deploy data pipelines automatically.
- Manage workflows for deploying solutions in development, test, and production environments.
- Monitor workflow executions to detect recurring errors and resolve them quickly.
- Add and remove collaborators based on project needs.
- Delete obsolete branches and ensure the repository doesn't accumulate unnecessary or inefficient code.
- Ensure tokens, API keys, and sensitive credentials are managed outside the repository (e.g., using GitHub Secrets).
- Review repository activity logs for unauthorized access or insecure practices.
- Assist developers in early tasks to resolve doubts about the environment or tools.
- Help resolve complex merge conflicts and provide strategies to avoid them in the future.
- Provide preconfigured scripts or tools to help developers with repetitive tasks, such as setting up local environments or validating code.
- Be available to the team via meetings, chats, or tickets, ensuring that questions are resolved quickly.
- Help developers correctly configure their local environments so they can quickly start working with the repository code.
- Validate and adjust their roles to ensure necessary access without compromising repository security.
- Investigate the error, adjust the workflow configuration, and offer recommendations to avoid similar problems.
Development Team Support ( API Developers and engineers)
Direct Support
- Coordination: Ensure that assigned tasks are completed within the established deadlines.
- Technical Support: Provide direct assistance in resolving technical issues, Python code errors, or workflow failures.
- Facilitation: Ensure they have the necessary tools and an appropriate work environment.
- Supervision: Monitor task progress and ensure operational continuity in case of absences.
- Communication: Communication will be direct and via Microsoft Teams:
- Group: A team channel to discuss general progress, notify blockers, or share project-related information.
- Individual: To address specific matters or resolve technical issues.
- Team Responsibilities The team must:
- Complete Tasks: Execute and supervise data ingestion and processing pipelines.
- Report Progress: Share a brief daily report in Teams for visibility and tracking.
- Document Knowledge: Keep key information about tasks and processes updated in the repository or assigned tools.
- Escalate Issues: Immediately report any bugs, blockers, or critical errors to receive support.
Availability of Tools
- Tool Management:
- Ensure they have: Remote or virtual machines, if needed.
- Updated Alteryx or required software licenses.
- Access to GitHub repository and other platforms.
- Local Configurations: Assist in setting up local development environments (Python, libraries, connectors).
- Daily Reporting Procedure To maintain visibility of progress, developers should: • Send a daily message in Teams (group or direct, as applicable) including: Completed Tasks: A brief summary of completed work.
- Pending Tasks: Activities in progress and missing steps.
- Blockers: Problems requiring immediate attention or support.
- Supervision and Monitoring Daily Monitoring: • Review sent reports and updates in GitHub repository issues.
- Tool Audits: • Periodically check that assigned tools are working and available for the team