Sr. Data Scientist

The Data Science team at Penguin Random House is seeking a Senior Data Scientist.

We are an agile team of data scientists and data engineers. The team has a wide mandate encompassing sales forecasting, supply chain, recommendation / personalization systems, uncovering trending titles, title segmentation, pricing systems, as well as data exploration and research applying novel statistical methods.

In this role, you will work on a variety of high-profile projects in collaboration with key decision makers across the organization. You should have a strong background in statistical forecasting models and experience in building production-stable pipelines in Python. You should also be a creative thinker capable of proposing and implementing novel approaches that impact the business.


Apply if you have:

  • A bachelor’s degree in mathematics, statistics, economics, computer science, business analytics, or any quantitative social science
  • 4+ years of professional experience programming in Python in a data science or ML role
  • Expertise in applying advanced statistical models for forecasting/prediction
  • Expertise in writing and maintaining stable production level code in Python (e.g., for automating data pipeline/modeling tasks)
  • Aptitude with shell scripting, debugging tools, containerized environments, and any flavor of Linux
  • Experience working with relational databases and fluency in SQL
  • Ability to communicate complex technical concepts to a business audience


Distinguishing Experience:

  • A master’s degree or PhD in a mathematical science, statistics, quantitative social science or related field. Alternatively, two years of additional experience in a data science or ML role
  • Experience building data products from the warehouse ingestion phase all the way through to the business-facing application side
  • Experience developing, testing, and deploying web applications using Flask, Django, or another Python web framework
  • Experience working with cloud-based computing platforms (e.g. AWS, Google Cloud Platform)
  • Experience working with container orchestration for deploying, scaling, and managing production tasks, preferably using Kubernetes and Rancher


Penguin Random House is the leading adult and children’s publishing house in North America, the United Kingdom and many other regions around the world.  In publishing the best books in every genre and subject for all ages, we are committed to quality, excellence in execution, and innovation throughout the entire publishing process: editorial, design, marketing, publicity, sales, production, and distribution.  Our vibrant and diverse international community of nearly 250 publishing brands and imprints include Ballantine Bantam Dell, Berkley, Clarkson Potter, Crown, DK, Doubleday, Dutton, Grosset & Dunlap, Little Golden Books, Knopf, Modern Library, Pantheon, Penguin Books, Penguin Press, Penguin Random House Audio, Penguin Young Readers, Portfolio, Puffin, Putnam, Random House, Random House Children’s Books, Riverhead, Ten Speed Press, Viking, and Vintage, among others.  More information can be found at


Penguin Random House values the array of talents and perspectives that a diverse workforce brings. All qualified applicants will receive consideration for employment without regard to race, national origin, religion, age, color, sex, sexual orientation, gender identity, disability, or protected veteran status.



Company: Penguin Random House LLC 

Country: United States of America 

State/Region: New York 

City: New York 

Postal Code: 10019 

Job ID: 140321

Date:  Jun 12, 2021

New York, NY, US, 10019

Nearest Major Market: Manhattan
Nearest Secondary Market: New York City

Job Segment: Database, Scientific, Scientist, Warehouse, Technology, Engineering, Science, Manufacturing, Research