Surbhi Gupta,印度西孟加拉邦Jalpaiguri的开发商
Surbhi is available for hire
Hire Surbhi

Surbhi Gupta

Verified Expert  in Engineering

数据科学家和机器学习开发人员

Location
Jalpaiguri, West Bengal, India
Toptal Member Since
November 2, 2021

Surbhi, 他曾是GenAI初创公司的首席技术官和MUJ的助理教授, is a generative AI, ML, 5年以上NLP工作经验. 她为Toptal的初创公司和Utopia的财富500强客户设计并开发了基于ml的端到端解决方案. 她的专长包括ML、深度学习、NLP、计算机视觉、llm、GPT、AI、MLOps和AWS. Surbhi解决了EAM、营销、金融、聊天机器人和加密行业的问题. 她发表了关于机器人和优化的研究.

Portfolio

Freelance Client
亚马逊网络服务(AWS)、AWS Amplify、人工智能(AI)...
Alec Beglarian
人工智能,深度学习,OpenAI, ChatGPT, JavaScript, React...
SimpliCapital LLC
Python, Jupyter, Amazon Web Services (AWS), 机器学习操作(MLOps)...

Experience

Availability

Part-time

Preferred Environment

Python, TensorFlow, Scikit-learn, OpenCV, Hugging Face, OpenAI, PyTorch, Amazon Web Services (AWS), 生成预训练变压器(GPT)

The most amazing...

...我开发的生成式人工智能解决方案与用户交互,以确定他们的品牌目的,并生成BVP和带有文本和图像的营销内容.

Work Experience

Co-founder and CTO

2022 - 2023
Freelance Client
  • 为某公司在外汇局融资中获得大笔投资资金, effective investor engagement, 以及对这项技术的清晰解释.
  • 开发产品的第一个版本,满足所有关键功能需求.
  • 领导由知名人工智能公司和投资者评估的技术尽职调查过程.
  • 通过有效的面试和评估方法,率先建立了一支优秀的团队.
  • Utilized OpenAI models, 有效的即时工程策略, 以及在人工智能引导下与人类用户进行对话的几次学习. 从这些互动中提取有价值的见解,并产生有影响力的营销主张.
  • 设计一个反馈机制,从现场专家那里收集训练数据,对模型进行微调.
Technologies: 亚马逊网络服务(AWS)、AWS Amplify、人工智能(AI), OpenAI GPT-3 API, User Feedback, Few-shot Learning, Vue, TypeScript

GPT-3 Expert

2022 - 2023
Alec Beglarian
  • 使用OpenAI GPT-3 api开发email生成MVP.
  • 在集成数据库的AWS云平台上开发MVP, storage, lambda functions, Amazon SES, etc.
  • 微调OpenAI模型的数据校正,用于电子邮件生成.
Technologies: 人工智能,深度学习,OpenAI, ChatGPT, JavaScript, React, 生成预训练变压器3 (GPT-3)

ML Developer

2022 - 2022
SimpliCapital LLC
  • 使用AWS云平台部署机器学习模型, 使用lambda函数等服务, Amazon SageMaker, Amazon SNS, Amazon S3, etc.
  • 改进机器学习模型性能,用于预测金融数据.
  • 为ML模型创建了Amazon SageMaker训练和推理管道.
Technologies: Python, Jupyter, Amazon Web Services (AWS), 机器学习操作(MLOps), Machine Learning, Data Science, Node.js, Amazon SageMaker

AI Specialist | NLP Python Developer

2022 - 2022
Toptal Client
  • 改进了NLP解决方案,以识别财务数据中的业务前景.
  • 使用基于bert的词性标注从大型文档中提取重要特征.
  • 通过识别用于标记与业务前景相关的句子的单词,使解决方案可解释.
  • 采用拥抱脸变换模型进行语义相似度分析.
技术:生成预训练变压器(GPT), GPT, Natural Language Processing (NLP), GPU Computing, Python, Semantics, Transformers, PyTorch, Large Language Models (LLMs)

AI专家| Python和ML开发者

2022 - 2022
Daylight
  • Used OpenAI for solving Q&一个聊天机器人和文档查询问题.
  • 为OpenAI文档查询解决方案提供了一个更准确的开源替代方案.
  • 利用基于语义相似度的聚类方法对聊天聚类进行分组识别.
技术:生成预训练变压器(GPT), GPT, Natural Language Processing (NLP), OpenAI, Transformers, Clustering, Scikit-learn, Beautiful Soup, Web Scraping, Deep Learning, GPU Computing, 生成预训练变压器3 (GPT-3), Data Analysis, Sequence Models, PyTorch, Large Language Models (LLMs)

AI专家| Python和ML开发者

2021 - 2022
Freelance
  • 对社交媒体数据进行姿态检测和话题建模, 使用无监督和半监督方法.
  • 为自定义摘要任务微调预训练的seq2seq转换器模型.
  • 在NLP任务中使用BERTscore和ROUGE评分等NLP性能评估指标,得分为0.总结任务89分.
Technologies: Machine Learning, Python, Scikit-learn, PyTorch, Transformers, 生成预训练变压器(GPT), GPT, Natural Language Processing (NLP), Transfer Learning, Sequence Models, Data Science, Graphics Processing Unit (GPU), BERT, Named-entity Recognition (NER), Entity Extraction, Matplotlib, LSTM, Classification, TensorFlow, Pandas, Artificial Intelligence (AI), Deep Learning, Time Series Analysis, Time Series, Data Analysis, GPU Computing, Clustering, Cloud, Large Language Models (LLMs)

Senior Data Science Engineer

2017 - 2021
Utopia
  • 开发端到端机器学习解决方案,用于从扫描文档和图表中提取信息,为一家财富100强公司带来了一笔好交易. 将项目作为云应用程序部署到客户端.
  • 构建了一个从描述和标签中识别设备类的解决方案, 哪个用于向多个客户端交付服务.
  • 创建了一个解决方案,该解决方案能够从用于向各种客户交付服务的材料主数据中的产品描述中识别有效值.
  • 开发了一个解决方案来识别图表图像中的不同形状、表格和文本. 这需要几种计算机视觉的应用, image processing, deep learning, and machine learning techniques.
Technologies: Machine Learning, Deep Learning, Computer Vision, 生成预训练变压器(GPT), GPT, Natural Language Processing (NLP), Python, TensorFlow, Git, Streamlit, OCR, Scikit-learn, Image Processing, OpenCV, Convolutional Neural Networks, NumPy, Amazon S3 (AWS S3), AWS Lambda, Tesseract, You Only Look Once (YOLO), Object Detection, Transfer Learning, Text Detection, Keras, Amazon Web Services (AWS), PyTorch, Cloud Deployment, 机器学习操作(MLOps), Named-entity Recognition (NER), Entity Extraction, SQL, Matplotlib, Algorithms, LSTM, Classification, Image Recognition, Pandas, Artificial Intelligence (AI), Data Analysis, Team Leadership, GPU Computing, Clustering, Cloud, Machine Vision, Code Review, Source Code Review, Technical Hiring, Interviewing

Assistant Professor

2017 - 2017
Manipal University Jaipur
  • 讲授机器人、机电一体化系统设计等课程, 包括关于机器学习的主题, artificial intelligence, and sensors.
  • 进行实验室实验,让学生亲身体验MATLAB, control systems, and sensors.
  • 撰写学期论文和在线测验,并评估学生的表现.
Technologies: Robotics, Machine Learning, Mechatronics, University Teaching, Matplotlib, Algorithms, Scikit-learn, Code Review, Interviewing

Senior Research Fellow

2012 - 2016
中央科学仪器组织
  • 优化了微创手术机械臂的设计,制定了末端执行器轨迹跟踪的运动学控制. Published two papers on this work.
  • 改进了被动双足机器人的设计,并对其进行了欠驱动仿真,使其在0 ~ 30度陡坡上稳定行走. Published two papers on this work.
  • 给毕业班的学生教授工业控制和机器人等课程.
Technologies: Robotics, Python, Control Systems, Underactuation, Research, Matplotlib, Algorithms, Classification, Scikit-learn, OpenCV, Artificial Intelligence (AI), Computer Vision, Time Series Analysis, Time Series

欠驱动双足机器人动力学与控制研究综述

本文对各种设计进行了总结, models, 以及用于使欠驱动双足机器人稳定行走和运行的控制策略. 我是这本书的第一个从事文献调查和写作的作者.

文章可从以下链接获得:http://www.tandfonline.com/doi/full/10.1080/01691864.2017.1308270

Split Compound Words

http://github.com/droid-surbhi/split-compound-words
开发并发布了一个GitHub存储库,用于将文本行中的复合词拆分为英语词典中的单词集合. 如果提供了非英语文本,那么还可以选择首先翻译为英语.

Fake Vs. Real News Classification

http://www.kaggle.com/surbhig/classification-fake-vs-news-95-accuracy
我创建并发布了一个Kaggle笔记本,可以将给定的新闻段落分为假段落和假段落. real, with 95% accuracy. Kaggle dataset for fake vs. 真实的新闻被用于训练和测试. 我使用多项朴素贝叶斯进行分类,使用TF-IDF进行特征矢量化.

Optimization Using Meta-heuristics

http://github.com/droid-surbhi/Optimization
我开发并发布了一个GitHub存储库,用于使用禁忌搜索和人工蜂群优化(ABC)等流行的元启发式优化。. 禁忌搜索是在MATLAB中从头开始编写的, 而ABC是用Python编写的,是该算法原作者发布的代码的修改版本. 这些修改是为了使用ABC实现约束整数优化.

微创手术机器人的设计优化

http://doi.org/10.1016/j.asoc.2015.03.032
我们优化了三自由度连续机械臂的设计,使其作为微创机器人手术(MIRS)手臂操作,并获得多个相邻可能的孔位置, 通过它可以跟踪预定几何形状的平面工作空间. To achieve this goal, 我们开发了一种算法,将这种MIRS手臂的设计与可能的孔位置联系起来. 采用模拟退火等元启发式方法求解优化问题, Tabu search, artificial bee colonization, and genetic algorithm, and their performance was compared.

LighVe: Music Synced Lights

LighVe是一套由灯条和一个移动应用程序组成的设备. LighVe可以接收房间里正在播放的任何音乐,并将灯光的视觉效果与音乐的节奏同步. I worked on the hardware, circuit design, electronics, 和应用程序设计,另一个合作者开发后端.

关节式微创手术机械臂的运动控制

http://ieeexplore.ieee.org/document/7853054
我们基于作用于末端连杆的约束条件创建几何变换,并结合使用传统技术获得的运动学关系,用于驱动模拟的六自由度通用关节机械臂进行微创手术. 该仿真手臂通过跟踪预定义的平面和三维轨迹验证了该方法.

Clustering Utilities

http://github.com/droid-surbhi/clustering
该存储库包含基于最新研究论文的新聚类方法和实用程序的实现. For example, 增量聚合集群, given old clusters, maps new data to old clusters, 为未映射的记录创建新的集群.

Resume Classification

http://github.com/droid-surbhi/resume_classification
对候选人简历进行无监督分类, development, 和管理类别使用Python.
•它使用潜在的狄利克雷分配进行主题建模,并计数矢量器进行矢量化.
•它还使用文字云可视化组.

Algorithm Design

http://github.com/droid-surbhi/algorithm_design
该程序允许用户练习算法设计问题的解决方案和解释. It includes topics like recursion, sorting, trees, graph search, dynamic programming, 以及筛选过程中出现的问题.

Motion Planning

http://github.com/droid-surbhi/motion_planning/blob/main/simpleMotion.ipynb
这篇文章探讨了在机器人运动中有用的各种运动规划算法, 包括简单的运动技术,如快速探索随机树(RRT), 最短路径的Dijkstra算法, probabilistic roadmaps (PRM), and more.

Languages

Python, SQL, c++, JavaScript, TypeScript

Libraries/APIs

Pandas, PyTorch, LSTM, TensorFlow, Scikit-learn, OpenCV, Keras, Matplotlib, AWS Amplify, Vue, NumPy, React, Beautiful Soup, Node.js

Tools

Git, Named-entity Recognition (NER), MATLAB, You Only Look Once (YOLO), Amazon SageMaker, Confluence, OpenAI Gym, Jupyter

Paradigms

Data Science

Other

Robotics, Artificial Intelligence (AI), Machine Learning, Deep Learning, Computer Vision, Natural Language Processing (NLP), Streamlit, OCR, Neural Networks, Tesseract, Research, Entity Extraction, Classification, Image Recognition, Data Analysis, OpenAI, Machine Vision, Large Language Models (LLMs), GPT, 生成预训练变压器(GPT), Chatbots, Fine-tuning, University Teaching, Control Systems, Underactuation, Convolutional Neural Networks, Object Detection, Transfer Learning, Text Detection, 机器学习操作(MLOps), Graphics Processing Unit (GPU), Transformers, BERT, Sequence Models, Algorithms, Time Series Analysis, Time Series, 生成预训练变压器3 (GPT-3), GPU Computing, Team Leadership, Cloud, Code Review, Source Code Review, Technical Hiring, Interviewing, Minimum Viable Product (MVP), Image Processing, Mechatronics, Optimization, Metaheuristics, Robot Operating System (ROS), Gated Recurrent Unit (GRU), Containers, Technical Writing, Publication, Simulations, Mathematics, Clustering, Web Scraping, Semantics, 生成对抗网络(GANs), Hugging Face, Unsupervised Learning, Topic Modeling, Data Visualization, DALL-E, ChatGPT, OpenAI GPT-3 API, User Feedback, Few-shot Learning, Motion Planning

Platforms

AWS Lambda、Amazon Web Services (AWS)、Docker、AWS Elastic Beanstalk

Storage

Amazon S3 (AWS S3)、云部署、谷歌云

2010 - 2012

Master's Degree in Mechatronics

印度工程科学技术学院-加尔各答,印度

2005 - 2009

电子与通信工程专业本科以上学历

邦德尔坎德大学-印度Jhansi

MAY 2022 - MAY 2025

AWS Certified Cloud Practitioner

Amazon Web Services

FEBRUARY 2022 - PRESENT

Introduction to Containers

AWS

JANUARY 2022 - PRESENT

AWS Elastic Beanstalk简介

AWS

JANUARY 2022 - PRESENT

Sequence Models

DeepLearning.AI

APRIL 2021 - PRESENT

介绍用于人工智能、机器学习和深度学习的Tensorflow

DeepLearning.AI | via Coursera

AUGUST 2020 - PRESENT

Git Complete: Git的最终一步一步指南

Udemy

MAY 2019 - PRESENT

Neural Networks and Deep Learning

DeepLearning.AI | via Coursera

AUGUST 2015 - PRESENT

6.00.x:计算机科学与Python编程入门

MITx

MARCH 2014 - PRESENT

Control of Mobile Robots

Coursera

JULY 2013 - PRESENT

Machine Learning

Coursera