Data Engineer Jumbotail

Download as pdf or txt
Download as pdf or txt
You are on page 1of 10

Data Engineer

Team: Engineering Location: Bengaluru

Who we are
Jumbotail is India's leading B2B marketplace and New Retail platform, serving lakhs of
mom & pop stores (“Kiranas”). Jumbotail helps retailers by providing them doorstep
delivery for all their food and grocery needs with its own supply chain and logistics
delivery network and also helps them to grow their business by offering credit.

Jumbotail also transforms traditional unorganized Kirana stores into J24 branded modern
convenience grocery stores, by providing a complete business toolkit and playbook
consisting of J24 consumer branding, in-store GoldenEye retail technology and
processes, real-time data-driven product selection and merchandising insights,
integrated supply chain with daily store servicing, Green Card loyalty program and
payment solutions which create a differentiated in-store experience for consumers. J24
also provides a complete omnichannel experience by taking the store online.

Jumbotail ecosystem has 4 in-house, proprietary platforms - B2B online marketplace,


Supply Chain & Logistics, Fintech for SME Lending, and Retail platform that powers its
J24 branded convenience grocery stores.

Jumbotail was founded by Karthik & Ashish, classmates from the Stanford MBA class of
2011. The company has raised USD $143M so far from Invus affiliate, Nexus Ventures,
Heron Rock, VII Ventures, Nutresa, Veronorte, Kalaari Capital, and several
well-renowned family offices and HNIs across the world. In Dec 2021 Jumbotail
completed its $85M Series-C funding round.

We have a strong and passionate team that is motivated to solve real-world problems
and impact lives. Our team consists of alumni from Stanford, MIT, IITs, IIMs, BITS, ISB, and
top NITs, having several years of industry experience in leading companies like Flipkart,
Amazon, Walmart, Ola, Oyo, and more. Together, we are reimagining and reengineering
the food and grocery ecosystem in India through intelligent technology, innovative
business models, and intuitive design. We are serving thousands of happy and satisfied
customers who trust Jumbotail for their food and grocery needs every day.

Jumbotail is at an inflection point - Covid-19 has brought Kiranas (its core customer
segment) the attention that their >95% share of India’s food and grocery market rightfully
merits. In parallel, it has exposed the frailties of traditional wholesale and FMCG
distribution as well as those of large-format modern trade. Retailers and brands alike, are
seeking a reliable, resilient, 21st-century alternative. Jumbotail is well-positioned to
capture this market opportunity and become an indispensable part of India’s $600 billion
food and grocery retail market.

While our market potential is huge - in the order of hundreds of billions of dollars, the
opportunity requires us to solve challenging problems that are so unique to India that
copycat models will not work. We are putting our customers first and building
technologies, products, platforms and services, and cutting edge supply chains that will
work for the next billion Indians who have fundamentally different needs - access,
language, selection, supply chain, financial and cultural.

The work you will do at Jumbotail will impact real lives and bring a lasting positive
change for the next billion people of India. We promise a fun, fast-paced ride with some
of the smartest people, with opportunities to learn and grow, and leave a legacy.
What are we looking for

We hire the best engineering minds and nurture them to solve tough engineering
problems in the retail ecosystem. We are solving complex problems in B2C, B2B retail,
Fintech and data platforms and we are always on the lookout for great talent who are
intrinsically motivated by the desire to solve hard problems.

As a data engineer, you will be responsible for ideating, designing, developing and
maintaining all the engineering building blocks which are required to keep the data
consistent, reliable and consumable.

We expect our data engineers to


- Architect the Data lake and Data warehouse infrastructure to enable the users to
execute business intelligence, data analytics, data science and ML/AI workloads
- Design, develop and maintain data pipelines for optimal extraction, transformation,
and loading of data from a wide variety of data sources.
- Build workflow orchestration tools to enable users to run periodic jobs on our data
infrastructure
- Build frameworks for data processing jobs to run decision science workloads such
as personalisation, fraud detection, optimisation algorithms and more
- Building observability over the data pipelines to maintain consistent data with an
agreed upon latency
- Ideate processes to maintain the health of the infrastructure such as cpu, memory
monitoring, query monitoring, workload monitoring and educate the teams to
follow the best practices. Set the standards high.
- Work closely with decision scientists, data analysts and product managers to
understand their data needs, their technical issues and help them in performance
tuning.
- Keep the data secure from the external world and have fine grained access
control within the organization to make sure that only relevant data is available to
every user.
- Create or integrate with tools for analytics and data scientist team members that
assist them in building and optimizing our product into an innovative industry
leader.

To cater to the above responsibilities, we need our data engineers to be equipped with
- Proven experience in Data engineering role with at least 1-5 years of experience
- High sense of ownership and strong decision making skills backed by first
principles thinking
- Strong analytical skills to break down the complex problems and build the right
solutions
- Good knowledge on data engineering concepts such as ETL pipelines, building
data lake, building data warehouses/ data marts, Stream and event processing,
distributed processing of large scale data.
- Experience building and optimising big data' data pipelines, architectures and
data sets.
- In depth knowledge in data warehousing concepts and familiarity with at least one
of the warehousing tools such as Amazon Redshift, Snowflake and more.
- Strong SQL knowledge and understanding of performance tuning techniques
- Good knowledge of big data tools such as Hadoop, Apache Spark, Apache Druid,
S3, Glue, Athena, Flink, Airflow, etc.
- Good exposure on streaming technologies like Kafka, SQS, Kinesis
- Good understanding of SQL and NoSql databases
- Experience with Amazon web services or Google Cloud Platform.
- Proficient in Java, Scala or Python.
- Experience in machine learning workloads is a plus
- Experience in supporting and working with product managers, decision scientists
and data analysts in a dynamic environment
Why is this role important to us?

Jumbotail has always been a technology-first organization. Over the years, we have built
many products to help our retailers, our sellers, our supply chain, and logistics heroes,
our credit partners, and a lot of internal teams.

Till January 2021, we were completely focused on helping retailers in Bangalore and
strengthening our business model. Over the last year, we started scaling across the
geographies, and currently, we are present in tens of cities across India. Our systems
had to scale 7X to support this and we had to build a lot of new products to cater to
increased scale.

With the recent round of $85m, we are set to scale across India and we need to scale
our systems by building next-generation technology to cater to the hyper-scale. We
need to invest heavily in our Search systems, Recommendation engines, Offer & Merch
systems, Order processing systems, Seller Platforms, Pricing Engines, Supply chain &
logistics systems, financial systems, new J24 retail platforms, and Credit systems to make
sure we are providing the best experience to our customers. Currently we are serving
lakhs of retailers across the nation and delivering millions of shipments every month
and we need to scale them atleast to 10X further.

Jumbotail as an organization is committed to data democratization and all the product,


business decisions happen through drawing insights from data. We have tens of
applications, hundreds of microservices that are generating data every second. Thus,
having consistent, reliable data which is stored and made available in a way that it can be
analyzed (real-time and historical) from various dimensions is a must for us.

As we are rapidly scaling across India to hundreds of cities, the volume of data we are
collecting, processing, and analyzing is growing exponentially. We need to be prepared
for the scale, future proof our systems and build the essential building blocks to help our
decision science & data analytics teams and product teams derive meaningful insights
and make impactful decisions.
What are the tools we use

We believe in choosing the right tool for the right job. We do not shy away from trying out
new technologies and learning from our mistakes.
Is Jumbotail the right place for me?

If you are an engineer who


- love solving real-world problems using technology
- can apply first principles thinking to solve problems
- can envision a great future that you want to create
- have the fire in your belly to get out of your cube and do something about your
vision and passion
- wants to work with some really smart people, and still raise the bar for all of us
- can have fun and help your colleagues have fun doing all of the above

Come join us in this scaling journey of Jumbotail. Your growth is in your hands, but
providing the opportunities and helping you grow is our promise.

Frequently Asked questions

1. Where is the office located?


a. We are located in Bangalore. Our office is located in Koramangala, near
Forum mall
2. Are you working from home?
a. Considering the current Covid situation, we are working from home. We are
yet to make a decision on future work policies.
3. What is the team culture like?
a. We live and breathe by our core values.
b. If you would talk to any of our existing or past employees, according to us,
this is what you might hear about us.
i. Extreme Ownership, Curiosity, being empathetic to each other,
sharing knowledge, humility, hunger to do better each day would
personify our team the closest.
c. Our team culture goes much more beyond the Friday evening fun sessions,
knowledge sharing sessions across the team, open feedback without
having any barriers of positions and the best way to understand that would
be by talking to any of the present or past team members.
4. What is your tech stack?
a. Please refer to What are the tools we use section
5. What are the challenging problems you are solving?
a. As we are scaling across India, we are required to solve a wide range of
new problems and scale the current system to at least 20x. Here are a few
examples, which you will have to solve in the next 6 months.
i. We generate millions of events everyday from our applications. How
do you design data architecture to enable users to slice and dice
data across varied dimensions with low latencies and eventual
consistency guarantees?
ii. We have a wide variety of data sources, we need to build and scale
the pipelines to a data lake/ data warehouse for historic analysis,
business reporting and for AI/ML workloads. How do you build and
scale the architecture? Throughput of this pipeline would need to at
least 50 million events per day at current scale and we need to scale
it to at least 10x without compromising on latency
iii. How do you architect reconciliation jobs to guarantee the
consistency of data in secondary sources of truth?
iv. A lot more complex problems which need to be solved by applying
first principles thinking are awaiting you.
6. What is the interview process?
a. Typically we conduct 2-3 interviews and one assignment round
7. How to prepare for the interviews?
a. Before interview
i. Please brush up your data engineering basics
ii. Our focus is more on fundamentals and practical application of
knowledge. Thus please try to practice as many hands on problems
as possible.
iii. Try to practice by keeping best practices in mind.
b. During interview
i. Most of our interviews are open google interviews. I.e. you can
search online for any new knowledge with the permission of the
interviewer. Thus do not worry about syntax.
ii. It is never about the solution but it is about the approach. Thus
please communicate more and explain your point of view and seek
help wherever required. Our effort is to have a similar discussion to
having a discussion with a colleague in a regular work environment.
8. I have feedback about an Interview. What should I do?
9. I interviewed with Jumbotail but did not receive an update. What should I do?
a. We try to provide the best candidate experience possible but in case if you
face any problems, please reach out to engineering@jumbotail.com
We look forward to having wonderful discussions with you. All the best for the interviews.

Jai Jawan, Jai Kisaan, Jai Dukaan !

You might also like