Data Engineer - AI & Machine Learning

TATA Consultancy Services Ltd.

San Francisco, CA

Job posting number: #7307006 (Ref:tsc-359351)

Posted: April 24, 2025

Job Description

Role: Data Engineer - Artificial Intelligence & Machine Learning

Location Options: Bay Area - CA




Responsibilities: -




1. Develop AI/ML Models:

•Design, build, and train machine learning models using appropriate algorithms (e.g., supervised, unsupervised, reinforcement learning, deep learning).

•Use various machine learning and AI frameworks (TensorFlow, PyTorch, Scikit-learn, Keras, etc.) to implement models.

•Experiment with different approaches (e.g., decision trees, neural networks, ensemble methods) and optimize for the best performance.

•Perform model selection, training, tuning, and validation using real-world data to achieve the most accurate results.

2. Data Preparation & Feature Engineering:

•Clean, preprocess, and structure raw data for analysis, ensuring it is suitable for model training.

•Implement data augmentation techniques, handle missing data, and remove outliers.

•Engineer features that will improve model performance, understanding the data's underlying relationships.

3. Algorithm Design and Optimization:

•Develop and optimize algorithms for specific use cases like image recognition, natural language processing (NLP), speech recognition, or recommendation systems.

•Optimize algorithms for high efficiency, scalability, and real-time performance.

•Regularly assess and improve model accuracy by experimenting with various hyperparameters, architectures, and optimization techniques.

4. Deploy Machine Learning Models:

•Collaborate with software engineers to deploy machine learning models into production environments, integrating them with existing systems.

•Ensure the models are scalable, performant, and able to handle real-time or batch data as required.

•Implement model monitoring and performance tracking tools to evaluate accuracy and detect any model drift over time.

5. Model Evaluation & Testing:

•Use cross-validation and other techniques to evaluate the model's generalization capabilities.

•Implement performance metrics to measure model accuracy, precision, recall, F1-score, and other relevant metrics based on project needs.

•Perform A/B testing and compare the performance of multiple models.

6. Continuous Improvement & Research:

•Stay up-to-date with the latest AI/ML research and advancements in the field, such as new algorithms, architectures, and technologies.

•Participate in code reviews and contribute to best practices in AI/ML development.

•Experiment with new AI and machine learning techniques to continually improve performance and solve complex problems.

7. Collaboration & Communication:

•Work closely with Data Scientists, Software Engineers, Product Managers, and other stakeholders to understand business problems and translate them into machine learning tasks.

•Communicate findings, insights, and progress to non-technical stakeholders in a clear, understandable manner.

•Collaborate on projects, providing expertise on AI/ML concepts to help shape product features or solutions.

8. Ethical Considerations and Bias Mitigation:

•Ensure that the models and algorithms are free from biases and ethically sound, particularly when dealing with sensitive data.

•Evaluate fairness, transparency, and interpretability of models, especially in critical applications like healthcare, finance, and legal sectors.

9. Documentation:

•Document the model-building process, algorithm choice, and data used, ensuring reproducibility and transparency.

•Write clear technical documentation and user guides to facilitate collaboration and knowledge transfer.

10. Innovation and Prototyping:

•Prototype AI-driven solutions to demonstrate their potential and feasibility.

•Develop proof of concepts (PoCs) and new algorithms for emerging AI and ML technologies (e.g., federated learning, reinforcement learning, generative models)




Qualifications:

1.Educational Background:

Bachelor's or Master’s degree in Computer Science, Data Science, Artificial Intelligence, Mathematics, or a related field.

Ph.D. in a relevant field is a plus but not required

2.Technical Skills:

•Strong programming skills in Python, R, or similar languages for machine learning and data analysis.

•Deep knowledge of machine learning libraries such as TensorFlow, PyTorch, Scikit-learn, Keras, and XGBoost.

•Strong foundation in linear algebra, probability, statistics, and optimization techniques.

•Proficiency in algorithms and data structures.

• Experience with deep learning techniques, including convolutional neural networks (CNNs), recurrent neural networks (RNNs), transformers, and reinforcement learning.

•Familiarity with NLP techniques, computer vision, time series analysis, and other AI sub-domains.

•Knowledge of data preprocessing, feature extraction, and feature selection techniques.

•Proficiency in cloud platforms (AWS, Azure, Google Cloud) for model training and deployment.

•Experience with containerization and orchestration tools (e.g., Docker, Kubernetes) is a plus.

•Familiarity with big data technologies (Hadoop, Spark, etc.) is a plus.

3.Soft Skills:

•Strong problem-solving and analytical skills.

•Ability to work in a collaborative, cross-functional environment.

•Effective communication skills to explain complex AI/ML concepts to non-technical stakeholders.

•Strong attention to detail and ability to troubleshoot and debug code.

•Passion for continuous learning and staying up-to-date with AI and ML advancements.

4.Experience:

•Proven experience (3+ years) in developing and deploying machine learning or AI models in a production environment.

•Familiarity with MLOps (machine learning operations) principles, such as model versioning, CI/CD pipelines for ML, and model monitoring in production

5.Preferred Qualifications:

•Experience with reinforcement learning, unsupervised learning, or generative models (e.g., GANs).

•Knowledge of ethics in AI, such as mitigating bias and ensuring fairness in models.

Familiarity with NLP libraries like SpaCy, NLTK, Hugging Face Transformers, etc.

•Experience in building and deploying AI-powered products in a commercial setting.

•Knowledge of edge computing and deploying AI models on edge devices

6.Work Environment:

•Collaborative and fast-paced work environment.

•Opportunity to work with state-of-the-art technologies.

•Supportive and dynamic team culture

•The position may require collaborating across multiple teams, including product, engineering, and research groups, to develop and implement AI/ML solutions




S alary Range: $149,500 - $224,250




#LI-AD1




Apply Now

Please mention to the employer that you saw this ad on AmericanCareers.com

More Info

Job posting number:#7307006 (Ref:tsc-359351)
Application Deadline:Open Until Filled
Employer Location:TATA Consultancy Services Ltd.
,
More jobs from this employer