First welcome to the community! Second, great questions!
- Yes, we are making updates week by week to the model, so expect continued updates to increase the context length.
- In general, we update pricing on a constant basis to keep up to date and stay competitive in the market. For example, a few weeks back we just cut the prices on our Qwen models. DeepSeek V3-0324 is the first model we have released in preview mode, so the team is evaluating and will provide more updates when its ready for production.
- Architecturally, yes and we will share updates once we make it available.
Would love to learn a bit more about your use case with AI and what you are looking to build. Would be really curious to learn how you are planning to take advantage of prompt caching.