Data Catalogue Lead
Data Science, Data Catalogue, Pharmaceutical, Data Analysis, Big Data
SQL, SaaS, Collibra
About Our Client
My client, a global leader in Pharmaceuticals have recently invested heavily in how the business and it's clients use data and need a leader to help grow the team.
As the Data Catalogue Lead you will be tasked with building a new team that will allow new internal and external data to be found using Collibra. You will create a scalable solution to support the development of data communities and valuable data products. To make this solution scalable we need to provide the services and tools to our partners in the R&D in a secure, compliant, stable and sustainable way.
As the Data Catalogue Lead you will own the build and run of the data catalogue as a capability for R&D and IT. This will involve building out the people, processes and technology to meet R&D product owner requirements and IT's architectural strategy. You will build the data catalogue operating model and work with solution architecture to create a technology roadmap. You will be accountable for the efficient, secure and complaint running of the catalogue.
They have chosen Collibra as the data catalogue technology. Our teams will be making use a range of data engineering products to acquire, ingest and curate metadata into the data catalogue (including Talend and AWS Glue). Your team will be supporting the cataloguing of a wide variety of data sources including: Omics, Imaging, clinical study, DMTA cycle systems, AI/ML model outputs, literature, sensor data, and external data sources. This will include implementing metadata models, building governance workflows, automating granting of access and building out APIs. You will have a close working relationship with our Metadata Lead and Information Architects, as well as the Data Lake Lead.
You will join a team that has delivered cloud solutions, such as the development of auto-scaling containerised ETL. Similarly, we have built an automated ETL test harness which integrates with our evolving CI/CD processes. You will need a collaborative delivery approach to be successful. We prefer to use Agile but choose the appropriate approach for the project. So, experience of a variety of delivery management methodologies will come in useful. You will provide technical leadership throughout our software development lifecycle, from the initial development of a technical design based on a blueprint, right through to hypercare. Do you have a real passion for delivering well engineered data and analytics solutions that can help improve patient lives?
The Successful Applicant
You will have experience of building or developing team,
Technical leadership in a data domain,
You will be able to demonstrate an ability to understand business needs and translate them into a solution,
You will be able to design and document development best practices,
You will need great interpersonal skills & a collaborative approach to delivery.
It is highly desirable that you have experience developing and managing a data catalogue or similar
Experience of big data, ETL & cloud techniques and tools (we currently use Talend. Redshift (inc. Spectrum), Glue, EMR, HIVE, Spark, S3, SQS, SNS)
Experience configuring and managing a SaaS system,
What's on Offer
My client are offering a fantastic package, flexible working and the opportunity to build a new team within a global organisation.