top of page

Blogs & Projects

Blog Posts

Leverage Streamlit, Hugging Face, and AWS to create your very own text generation app

Prompt engineering and how it can be used with text generation models

Screenshot 2022-07-31 at 14.34.53.png

Use SageMaker Processing Jobs to easily augment your NLP Dataset with Hugging Face’s Transformer Models

Screenshot 2022-07-31 at 14.35.25.png

Projects

Ashoka Strategy Bot

Founded in 1980, Ashoka is the world's largest network of social entrepreneurs. Ashoka selects, supports and collaborates with 4000 social entrepreneurs (Ashoka Fellows) in over 90 countries, helping them scale their system changing solutions within every field, from education to health care, to environment, economic development and human rights. Ashoka was looking to create and implement an “Impact Strategy Bot” based on OpenAI’s GPT-3 model. The goal was to create an NLP model that generates an interesting strategy for addressing a social problem provided by a prompt from the user.

 

My task was to create a finetuned custom GPT-3 model for Ashoka to accomplish this goal. To achieve this I first analysed, cleaned, and prepared the more than 4,000 records that Ashoka provided as a potential training dataset. After diving deeper into the data I identified potential features that could be used for training GPT-3. I prepared the dataset accordingly and also engineered the appropriate prompts for the model to maximise the likelihood of the model creating interesting strategies. I then trained GPT-3 using OpenAI’s API. After a few iterations and testing the custom models together with Ashoka I was successful in creating an NLP model that would generate interesting strategies from a short prompt mentioning a social problem. The model will be implemented by Ashoka to be used by future social entrepreneurs within their platform.

Screenshot 2022-10-23 at 14.04_edited.jp

Contract review is the process of thoroughly reading a contract to understand the rights and obligations of an individual or company signing it and assessing the associated impact. It is widely viewed as one of the most repetitive and most tedious jobs that junior law firm associates must perform. It is also expensive and an inefficient use of a legal professional’s skills. In this project I show how to set up a machine learning models to automate contract reviews.

Screenshot 2022-10-23 at 14.04_edited.jp

As a volunteering data scientist at  DataKind UK I worked with Global Witness on a project to analyse the world’s first fully open register of the real owners of its companies. We were looking for mistakes and suspicious signs, while also comparing information in the register with other datasets. This provided us with an unprecedented overview of UK company ownership today and allowed us to identify loopholes, information gaps and suspicious activity.

Our analysis revealed that thousands of companies are filing highly suspicious entries or not complying with the rules – problems we never would have found were it not for the open data nature of the register.

Screenshot 2022-07-31 at 15.06.25.png

AI/ML Consulting LTD is registered with Companies House

aiml-consulting@posteo.net

71-75 Shelton Street, Covent Garden, London, United Kingdom, WC2H 9JQ

AI/ML Consulting Ltd provides consulting services for AI & Machine Learning. We help organisation analyse their business use cases and discover opportunities to unlock the power of AI & ML. We provide training to senior stakeholders and product teams to enable them to use AI within their teams. 

bottom of page