Codemia | Master System Design Interviews Through Active Practice

My Solution for Design a Graph Search Function for a Social Network with Score: 8/10

by zion7046

System requirements

Functional:

User should be able to query with multiple filters
Filters can support AND and OR capabilities between the filters.
My personalized context should already be part of the query
User should be able to link people, places, interest and activities together.

Non-Functional:

Query should be fast like P90 should be under 200ms
System should scale to handle billions of queries per day - be partition tolerant
Consistency is desired, eventual consistency should work.

Capacity estimation

For a social network with billions of monthly active user, lets assume 100 MM daily active users making 5 queries each. so looking at 500MM queries per day. 500k consistent TPS and 2x at peak time, so looking at approx 1mm TPS readiness
lets say we allow attributes like People who are friends of, like places, have interests to searched on. We are looking to store approx, 200 friend IDs, 10 places, 30 interest, 10 activities. So roughly 250 unique attributes per user.
Each unique attribute can be represented by 16 Byte UUID. 4KB data per user. 1 B Users, 1MM unique places, 10k interests, 10k activities

= approx 1.2B unique UUIDs * 4 KB = ~5 Tb data.

API design

indexEntities(Type, Schema); // people, place etc and may require different schema per entity. Could be 1 API or multiple based on the structure.
List <OutputStructure> os = search ("natural language input"); // this can be a GraphQl layer that breaks down the input into system understandable Query input for API below.
Response<List<responseIds, queryRequestFields>, nextToken> x = search(Query <list of filters>, SortBy< list of columns, Asc/Desc>, page size, nextToken);
Details objectByEntityType = batchGetDetails(responseIDs);
Top5Suggestions = intellisenseAPI("inputString"); // to help autocomplete the search.
Filter -> column name, expected value as single vs elements of list, operator (=, regex, >,<, ... )

Database design

SQL can be considered as the joins could help form the Graph structure on the fly. Multiple indexes can be created on columns to facilitate a faster search. However, the data size is too huge for SQL to scale properly and multiple indexes will increase the storage index.
NoSQL like DynamoDb scales well for situation like this. We can have indexes like GSI to filter and sort on multiple fields. however supporting multiple filters can be tricky. it can be solved by having composite filters.
Users table: UserIDs, InterestList, FriendList, activityList.
InterestTable: InterestId, location, details
ActivityTable: AcitivityId, details, description etc.

High-level design

You should identify enough components that are needed to solve the actual problem from end to end. Also remember to draw a block diagram using the diagramming tool to augment your design. If you are unfamiliar with the tool, you can simply describe your design to the chat bot and ask it to generate a starter diagram for you to modify...

Indexing Service listens to updates from various sources like UserData, places of interests, activities etc and stores it in respective tables.
A data stream captures all updates in these tables and joins them in a singular row and put into elastic search cluster.
all IDs are indexed.
Elastic search can handle AND and OR filters when we structure the schema and define the type for indexed columns.
Results IDs are returned and supplemented with more details from the respective tables of User, activity and interests.
All respective tables can be cached.
Activities and interests can be cached based on geographical preferences.

Request flows

Explain how the request flows from end to end in your high level design. Also you could draw a sequence diagram using the diagramming tool to enhance your explanation...

Detailed component design

Endpoint hits load balancer that hits stateless server hosts of Indexing and Search Service.
Indexing Service listens to updates from various sources like UserData, places of interests, activities etc and stores it in respective tables.
A data stream captures all updates in these tables and joins them in a singular row and put into elastic search cluster.
all IDs are indexed.
Elastic search can handle AND and OR filters when we structure the schema and define the type for indexed columns.
Results IDs are returned and supplemented with more details from the respective tables of User, activity and interests.
All respective tables can be cached with service like Redis. We could have LRU as eviction strategy which resonates with keeping most sought entry in cache.
Activities and interests can be cached based on geographical preferences. same LRU strategy.
To handle the hot partition issue, we can have data sharding.
We can have exponential back-off strategy to retry failures to ensure no heavy load during downtime.
Given that we have 2 data sources now, We can also have an auditor job that make sure of data consistency between tables and ES cluster.

Trade offs/Tech choices

NoSQL for storing first layer of data and ES to store the joined data provides value at the right level
GraphQL against an API structure allows natural language processing from User.

Failure scenarios/bottlenecks

Internal failures will be handled by exponential backoff.
ES writes could be a bottleneck if the data is updated too frequently. writes can be batched if more eventual consistency is acceptable.

Future improvements

Sharding of data at geographical level.
Replication of data in other regions for disaster recovery.
Personalization can be introduced to make search more relevant based on user likes, posts etc. A GenAI based solution could be meaningful to improve relevancy.