Jump to content

Tech learning roadmaps…


dasari4kntr

Recommended Posts

tell me this...spring based micro services lo DB design

3 services vunnavi...connected to their own schems...1 to 1 lekkana

 

service 1(uses schema 1) lo user table vundhi ... id (identity column, PK) name ,email, unique account id.

Service 2 lo and service 3 lo account id avasaram...ippudu account id ye ikkada use cheyatam better aaa in (schema 2 and schema 3) or id(from schema 1) column value better. db design appudu

 

Link to comment
Share on other sites

3 hours ago, Mr Mirchi said:

tell me this...spring based micro services lo DB design

3 services vunnavi...connected to their own schems...1 to 1 lekkana

 

service 1(uses schema 1) lo user table vundhi ... id (identity column, PK) name ,email, unique account id.

Service 2 lo and service 3 lo account id avasaram...ippudu account id ye ikkada use cheyatam better aaa in (schema 2 and schema 3) or id(from schema 1) column value better. db design appudu

 

 

account id reference is better...

keep id as identity and auto increments only...

 

Link to comment
Share on other sites

55 minutes ago, dasari4kntr said:

 

account id reference is better...

keep id as identity and auto increments only...

 

why account id ref in another db.schema..why not id..kind of pk/fk la wok avudhi kadhaa...if acct id needs to be updated for any reason..we can do only in user table kadhaaaa

Link to comment
Share on other sites

10 hours ago, Mr Mirchi said:

why account id ref in another db.schema..why not id..kind of pk/fk la wok avudhi kadhaa...if acct id needs to be updated for any reason..we can do only in user table kadhaaaa

generally concurrency issues vastayi ani avoid chestaru…

another thing is design…auto increments are numbers…during maintenance or backup restores… if seed reset happens…your system doesn’t work…

Link to comment
Share on other sites

  • 2 weeks later...

SPACE TITLE:

 

HOW TO GET YOUR FIRST JOB IN TECH AS A DEVELOPER... 

i didnt listen full...hope it might give good ideas about interviews and process... 

 

Link to comment
Share on other sites

 

F0W2AAoaMAEtrkl?format=jpg&name=large

What is a 𝗩𝗲𝗰𝘁𝗼𝗿 𝗗𝗮𝘁𝗮𝗯𝗮𝘀𝗲?

With the rise of Foundational Models, Vector Databases skyrocketed in popularity. The truth is that a Vector Database is also useful outside of a Large Language Model context.

When it comes to Machine Learning, we often deal with Vector Embeddings. Vector Databases were created to perform specifically well when working with them:

Storing.

Updating.

Retrieving.

When we talk about retrieval, we refer to retrieving set of vectors that are most similar to a query in a form of a vector that is embedded in the same Latent space. This retrieval procedure is called Approximate Nearest Neighbour (ANN) search.

A query here could be in a form of an object like an image for which we would like to find similar images. Or it could be a question for which we want to retrieve relevant context that could later be transformed into an answer via a LLM.

Let’s look into how one would interact with a Vector Database:

𝗪𝗿𝗶𝘁𝗶𝗻𝗴/𝗨𝗽𝗱𝗮𝘁𝗶𝗻𝗴 𝗗𝗮𝘁𝗮.

1. Choose a ML model to be used to generate Vector Embeddings.

2. Embed any type of information: text, images, audio, tabular. Choice of ML model used for embedding will depend on the type of data.

3. Get a Vector representation of your data by running it through the Embedding Model.

4. Store additional metadata together with the Vector Embedding. This data would later be used to pre-filter or post-filter ANN search results.

5. Vector DB indexes Vector Embedding and metadata separately. There are multiple methods that can be used for creating vector indexes, some of them: Random Projection, Product Quantization, Locality-sensitive Hashing.

6. Vector data is stored together with indexes for Vector Embeddings and metadata connected to the Embedded objects. 𝗥𝗲𝗮𝗱𝗶𝗻𝗴 𝗗𝗮𝘁𝗮.

7. A query to be executed against a Vector Database will usually consist of two parts: Data that will be used for ANN search. e.g. an image for which you want to find similar ones. Metadata query to exclude Vectors that hold specific qualities known beforehand. E.g. given that you are looking for similar images of apartments - exclude apartments in a specific location.

8. You execute Metadata Query against the metadata index. It could be done before or after the ANN search procedure.

9. You embed the data into the Latent space with the same model that was used for writing the data to the Vector DB.

10. ANN search procedure is applied and a set of Vector embeddings are retrieved. Popular similarity measures for ANN search include: Cosine Similarity, Euclidean Distance, Dot Product.

Some popular Vector Databases: Pinecone, Weviate, Milvus, Vespa.

-------- Follow me to upskill in #MLOps, #MachineLearning, #DataEngineering, #DataScience and overall #Data space.

Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...