24 Oct
LexisNexis Intellectual Property Solutions
United Kingdom
Senior Data Scientist - GenAI enablement team
London/ remote
About our Team
The LexisNexis Intellectual Property (IP) division (https://www.lexisnexisip.com) provides international patent content and a suite of online and analytic tools that meet the evolving needs of the intellectual property market. We deliver data to support LexisNexis IP search and analytics applications, empowering our customers with actionable insights and metrics for critical business decisions.
Our corporate culture thrives on excellence, innovation, and a strong dedication to our customers, employees, and communities. Working here means joining a vibrant, diverse, and collaborative team where you are free to grow and contribute actively.
About the Role
We are seeking a Senior GenAI/Data Scientist to join our AI Innovation Team. This role will focus on experimenting, optimising and applying Generative AI off the shelf models to extract valuable insights from large-scale patent datasets and enhance our search and analytics tools. You will collaborate with data scientists, product teams, and stakeholders across different geographies, driving innovation through LLMs (Large Language Models) and advanced AI methodologies.
Responsibilities:
- Break down complex business problems into actionable AI solutions, leading the design and development of Generative AI models.
- Work closely with cross-functional teams to identify key areas for AI-driven innovation in patent search and analytics applications.
- Collaborate with the team to explore Generative AI use cases, including automated summarisation, natural language understanding, and text generation.
- Ensure solutions are scalable, maintainable, and aligned with best practices in machine learning.
- Work on GenAI techniques like Prompt Engineering, RAG (Retrieval-Augmented Generation) and perform evaluation using frameworks to optimise LLM performance.
- Develop and implement machine learning workflows, focusing on the integration of GenAI with existing data infrastructure.
- Perform continuous evaluations and improvements of models to handle large volumes of patent data.
- Collaborate with data engineers and data scientists to integrate AI models seamlessly into the broader data architecture.
- Provide mentorship and coaching to junior team members, fostering a learning culture within the team.
Requirements:
- Demonstrate 4+ years of experience in data science, with a focus on NLP, Generative AI and LLMs.
- Proficiency in Python and experience working with LLMs and NLP frameworks (e.g. Hugging Face, Spacy, Pytorch/Tensorflow etc).
- Experience with Prompt Engineering, RAG techniques and various evaluation methodologies for integrating GenAI with search/retrieval systems and measure the quality.
- Experience with LangChain / LlamaIndex, vector databases (e.g., FAISS), fine-tuning models on domain-specific data.
- Experience working with cloud platforms like Azure, AWS, or GCP for machine learning workflows.
- Understanding of data engineering pipelines and distributed data processing (e.g., Databricks, Apache Spark).
- ·Strong analytical skills,
with the ability to transform raw data into meaningful insights through AI techniques.
- Experience with SQL, ETL processes, and data orchestration tools (e.g. Azure Data Factory, Talend) will be an advantage
▶️ Senior Data Scientist
🖊️ LexisNexis Intellectual Property Solutions
📍 United Kingdom