Open in app

Sign In

Write

Sign In

Nick Sorros
Nick Sorros

139 Followers

Home

About

Published in MantisNLP

·Mar 1

Eight examples is all you need ✨

It is no secret that the power of machine learning models comes from the data that they are trained on. It should come as no surprise then that the most common question we get at the start of any project we take here at Mantis is “How much data is…

Naturallanguageprocessing

6 min read

Eight examples is all you need ✨
Eight examples is all you need ✨
Naturallanguageprocessing

6 min read


Published in MantisNLP

·Nov 9, 2022

Speed up 🏎️ data annotation with active learning ✨

Access to high quality annotated data remains one of the biggest limiting factors preventing companies from progressing an AI use case into a working implementation. This is despite great advances in few shot learning and pre-trained models which lower the barrier to entry and reduce the amount of annotated data…

Active Learning

5 min read

Speed up 🏎️ data annotation with active learning ✨
Speed up 🏎️ data annotation with active learning ✨
Active Learning

5 min read


Published in MantisNLP

·Sep 28, 2022

Match text from different domains using language models 📑

Organizations with a large collection (corpus) of documents often need to find similar documents. For a grant-giving organization (for example a charitable foundation like the Wellcome Trust), the use case could be finding experts to review grant applications by matching the expert’s publication history to the grant application. For a…

Language Model

5 min read

Match text from different domains using language models 📑
Match text from different domains using language models 📑
Language Model

5 min read


Published in MantisNLP

·Aug 3, 2022

How to organize your grants using machine learning 🏷

Large grant giving organizations like the Wellcome Trust or Cancer Research UK receive thousands of applications for funding grants each year. Analysis of this data can be complicated by the fact that adding useful metadata to funding grant applications is often time consuming to do manually. By metadata, we mean…

Grants For Nonprofit

4 min read

How to organize your grants using machine learning 🏷
How to organize your grants using machine learning 🏷
Grants For Nonprofit

4 min read


Published in MantisNLP

·Jun 29, 2022

MLOps with SageMaker — Part II

Customize train 🐳 — In an earlier post we went through how to run a training script using sklearn, PyTorch or transformers with SageMaker by leveraging their preconfigured framework containers. The training scripts we used were self contained, meaning they only used the respective framework and python standard library. …

Mlops

6 min read

MLOps with SageMaker — Part II
MLOps with SageMaker — Part II
Mlops

6 min read


Published in MantisNLP

·May 4, 2022

MLOps with SageMaker — Part I

How to effortlessly train sklearn 📊, pytorch🔥, and transformers 🤗 models in the cloud — SageMaker is a Machine Learning Operations (MLOps) platform, offered by AWS, that provides a number of tools for developing machine learning models from no code solutions to completely custom. With SageMaker, you can label data, train your own models in the cloud using hyperparameter optimization, and then deploy those models…

Mlops

7 min read

MLOps with SageMaker — Part I
MLOps with SageMaker — Part I
Mlops

7 min read


Published in MantisNLP

·Apr 13, 2022

Making an optimisation algorithm 10k times faster 🏎

How we made our multilabel classification threshold optimizer converge in minutes instead of days — Multilabel classification is a common task in machine learning and Natural Language Processing (NLP). We approach it by training a model that can apply one or more labels to each new example that it sees. …

Machine Learning

7 min read

Making an optimisation algorithm 10k times faster 🏎
Making an optimisation algorithm 10k times faster 🏎
Machine Learning

7 min read


Published in Wellcome Data

·Dec 13, 2021

Tagging biomedical grants with 29K tags

In a previous post we spoke about a neural architecture we developed for classifying our grants with ~5K disease tags from the MeSH (Medical subject Headings) hierarchy. In this post we will touch on the techniques needed to scale to a model to classify all ~29K MeSH tags. Our dataset…

Naturallanguageprocessing

7 min read

Tagging biomedical grants with 29K tags
Tagging biomedical grants with 29K tags
Naturallanguageprocessing

7 min read


Published in MantisNLP

·Jun 20, 2021

Introducing Mantis

In this blog post we introduce Mantis NLP, a remote first company we have founded to help companies put impactful data science into production. …

Consultancy

2 min read

Consultancy

2 min read


Published in Wellcome Data

·Apr 23, 2021

Reproducible data science

Ever since we started working on data science projects at Wellcome data labs we have been thinking a lot about reproducibility. As a team of data scientists, we wanted to ensure that the results of our work can be recreated by any member of the team. Among other things this…

Reproducibility

7 min read

Reproducible data science
Reproducible data science
Reproducibility

7 min read

Nick Sorros

Nick Sorros

139 Followers

Founder of MantisNLP www.nicksorros.com

Following
  • Peak

    Peak

  • Matthew Upson

    Matthew Upson

  • Nir Eyal

    Nir Eyal

  • Arthur Juliani

    Arthur Juliani

  • Orfeas Kypris

    Orfeas Kypris

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech