Skip to content

Dynamic Parameters for Mediatek Delegate #9035

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
shreshth-tru opened this issue Mar 7, 2025 · 1 comment
Open

Dynamic Parameters for Mediatek Delegate #9035

shreshth-tru opened this issue Mar 7, 2025 · 1 comment
Assignees
Labels
partner: mediatek Issues related to the Mediatek delegate triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module

Comments

@shreshth-tru
Copy link

shreshth-tru commented Mar 7, 2025

🚀 The feature, motivation and pitch

While working with the MediaTek delegate for android app I am trying to identify all the dynamic parameters that can possibly be set with the current implementation of the model generation (and what limitations are there for now). Currently are model parameters are hardcoded for the android llama app which can be made dynamic by passing the .h file as a converted .txt file and mapped to variables at the backend.

  • Objective:

    • Identify all the dynamic parameters that can be set within the current model generation implementation.
    • Understand the limitations regarding configurable variables.
  • Potential Configurable Parameters:

    • Context Length:
      • Can this be adjusted dynamically?
      • Expected impact on model performance and response time.
    • Prompt Token Batch Size:
      • Is this configurable?
      • Expected effect on memory usage and response latency.
    • Cache Size:
      • Can we change the cache size dynamically?
      • Expected effects on memory management and model inference speed.
    • Other Variables:
      • Are there other variables (e.g.,ROT_EMB_BASE , temperature, top_p) that can be configured dynamically?
      • What are the expected outcomes of changing these parameters?
  • Expected Outcomes:

    • Clarification of which parameters can be safely modified without impacting the model's functionality.
    • Improved flexibility for adjusting model behavior based on real-time requirements or user input.

This should help make the question clearer and provide a roadmap for addressing these dynamic configurations.

Alternatives

No response

Additional context

No response

RFC (Optional)

No response

cc @cccclai @neuropilot-captain @cbilgin

@iseeyuan iseeyuan added triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module partner: mediatek Issues related to the Mediatek delegate labels Mar 7, 2025
@iseeyuan
Copy link
Contributor

iseeyuan commented Mar 7, 2025

@cccclai Could you help on this, or ping MTK folks?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
partner: mediatek Issues related to the Mediatek delegate triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module
Projects
None yet
Development

No branches or pull requests

4 participants