1 month ago

Data Engineer, Amazon AGI, AGI Content Management and Protection

US, MA, Boston
AI is the most transformational technology of our time, capable of tackling some of humanity’s most challenging problems. Amazon is investing in generative AI and the responsible development and deployment of large language models (LLMs) across all of our businesses. Come build the future of human-technology interaction with us.
We are looking for those candidates who just don’t think out of the box, but make the box they are in ‘Bigger’. The future is now, do you want to be a part of it? Then read on!

We’re looking for a Data Engineer on Amazon’s AGI team to build world-class data platforms and deploy scalable data ingestion tools with a commitment to foster the safe, responsible, and effective development of AI technologies . The ideal candidate is an expert with Petabyte scale data ingestion, processing data, data modeling, ingestion systems design and business intelligence tools and passionately partners with the business to identify strategic opportunities where improvements in data infrastructure creates outsized business impact. They are a self-starter, comfortable with ambiguity, able to think big (while paying careful attention to detail) and enjoys working in a fast-paced team. The ideal candidate needs to possess exceptional technical expertise with largescale lakehouses, distributed computing at a scale of thousands of hosts on multiple clusters, Spark, BI systems and AWS services.


Key job responsibilities

· Design, implement, and support a platform providing ad hoc access to large datasets
· Interface with other technology teams to extract, transform, and load data from a wide variety of data sources using Spark or any other state of the art systems
· Implement data structures using best practices for lakehouses
· Model data and metadata for ad hoc and pre-built reporting, meeting read/write/summary optimized storages
· Interface with business customers, gathering requirements and delivering complete reporting solutions
· Build robust and scalable data integration (ETL) pipelines using Kotlin, Python, typescript and Spark
· Build and deliver high quality datasets to support ML training needs
· Continually improve ongoing automating or simplifying self-service Data ingestion at scale for customers
· Participate in strategic & tactical planning discussions

Please mention that you found this job on MoAIJobs, this helps us grow. Thank you!

Share this job opportunity

Related Jobs

Amazon
1 week ago

Data Engineer, Amazon Operations Finance Standardisation and Automation

US, TX, Dallas
AMD
3 weeks ago

Director, Chief Engineer, Program Management Server and Data Center

Austin, Texas
X AI
3 weeks ago

Data and Infrastructure Engineer - Multimodal

San Francisco & Palo Alto, CA
Amazon
3 weeks ago

Software Dev Engineer - Amazon Connect, Amazon Connect - Identity Management and Access Control

US, WA, Seattle
Amazon
3 weeks ago

Amazon Dedicated Cloud Engineer, Files, Edge, Messaging and Data Streaming

US, VA, Arlington