My Two Weeks Fine-Tuning an LLM: Why Open Source Matters

🌐🇩🇪 Deutsch 🇫🇷 Français 🇫🇷 Français 🇫🇷 Français 🇪🇸 Español 🇺🇸 English

📖 9 min read•1,755 words•Updated Mar 26, 2026

Hey everyone, Kai here, back on clawdev.net after what felt like a whirlwind two weeks wrestling with a particularly stubborn LLM fine-tuning project. My brain’s still a little fried, but it got me thinking. We talk a lot about the big, flashy AI models, the impressive breakthroughs, the “what’s next” in the AI world. But what about the nuts and bolts? What about the sheer, unglamorous, yet utterly essential act of contributing to open-source AI projects?

Specifically, today I want to explore something I’ve been seeing a lot of chatter about – and frankly, experiencing myself – the evolving space of contributing to established, often complex, open-source AI libraries. It’s not just about submitting a bug fix anymore. With the rapid pace of AI development, maintaining these projects is a beast, and getting your contributions accepted requires a bit more finesse than it used to. Let’s talk about how to make your contributions actually stick.

The Evolving Gatekeepers: Why Getting Your PR Merged is Harder Than Ever

Remember the good old days? You’d spot a typo in a docstring, fix it, send a PR, and boom, instant contributor status. While those opportunities still exist (and are always appreciated!), for anything more substantial in a popular AI library, the bar has definitely risen. Why?

1. Project Maturity and Stability Demands

As AI libraries like Hugging Face Transformers, PyTorch, or even smaller, specialized frameworks mature, their core maintainers become increasingly focused on stability and backward compatibility. A seemingly small feature addition might introduce unforeseen regressions or break existing workflows for thousands of users. This isn’t gatekeeping for the sake of it; it’s a necessary evil to keep the ecosystem from collapsing.

I learned this the hard way last year trying to add a very specific, niche preprocessing step to a popular audio library. I thought it was a brilliant optimization. The maintainers, however, pointed out (very politely, thankfully) that it introduced an additional dependency that wasn’t strictly necessary for the core functionality and could complicate future updates. My PR got closed. It stung, but they were right.

2. The AI-Specific Complexity Tax

AI code, especially deep learning models, often involves intricate mathematical operations, specific hardware considerations (GPUs, TPUs), and a delicate balance of performance and accuracy. A simple change in an optimizer, for instance, might have profound effects on training stability or convergence. Testing these changes thoroughly isn’t trivial. It often requires specific datasets, compute resources, and a deep understanding of the model’s inner workings.

Just last month, I was debugging a weird NaN loss issue in a custom diffusion model implementation. Turns out, a small change in how a tensor was initialized (from `torch.zeros` to `torch.empty` for a slight speedup) was causing issues on certain GPU architectures due to uninitialized memory. It was a subtle bug, and it highlights how even seemingly minor code tweaks in AI can have disproportionately large impacts.

3. Maintainer Burnout and Resource Constraints

Let’s be real: maintainers of these massive projects are often doing it in their “free time” or as part of their day job, which already involves a million other things. They’re flooded with issues, feature requests, and pull requests. If your PR isn’t well-explained, doesn’t adhere to conventions, or requires extensive back-and-forth, it’s more likely to get deprioritized or even overlooked.

I’ve been on both sides of this. As a maintainer of a small utility library, I’ve seen PRs that were essentially just a raw code dump with no explanation. It’s frustrating because it means I have to spend my limited time figuring out what the contributor *intended* to do, rather than reviewing their *solution*.

How to Make Your AI Open-Source Contributions Shine (and Stick)

So, given these challenges, how do we, as eager contributors, ensure our efforts aren’t in vain? It’s about being strategic, thoughtful, and frankly, a little empathetic to the maintainers’ plight.

1. Do Your Homework: Read the Docs, Issues, and Existing PRs

Before you even think about writing a single line of code, spend an hour (or three) immersing yourself in the project. Read the contribution guidelines. Seriously, read them. Look at existing open and closed issues. Is someone else already working on this? Has this feature been rejected before and why? Are there any design discussions around similar topics?

This avoids “reinventing the wheel” or proposing something that goes against the project’s long-term vision. It also shows maintainers you’ve put in the effort, which immediately earns you goodwill.

2. Start Small, Build Trust

Don’t jump straight into proposing a massive new feature that re-architects half the library. Start with smaller, more manageable contributions. This could be:

Improving documentation (fixing typos, clarifying ambiguous sections, adding examples).
Refactoring existing code for readability or minor performance gains (but only if explicitly encouraged by maintainers).
Submitting a well-isolated bug fix for a clearly defined issue.
Adding a new, simple test case for an existing feature.

Each successful, small contribution builds your reputation within the community. Maintainers get to know your coding style, your attention to detail, and your ability to follow instructions. This makes them more likely to trust you with larger contributions down the line.

My first accepted PR to a popular ML framework was literally just adding a missing argument to a function’s docstring example. It took me 10 minutes, but it got my name on the contributor list and gave me a confidence boost.

3. Craft an Impeccable Pull Request

This is where many potentially great contributions fall short. Your PR isn’t just code; it’s a proposal, a narrative. Think of it as selling your idea to the maintainers.

a. Clear, Concise Title

Summarize the essence of your PR. Good: `Fix: NaN loss on AdamW with AMP`. Bad: `Update optimizer stuff`.

b. Detailed Description

This is crucial. Explain:

What problem does this PR solve? (e.g., “The current `x_y_z` function miscalculates `foo` in edge cases, leading to `bar` behavior.”)
Why is this solution the best approach? (e.g., “I considered `approach A` but it introduced `overhead`, and `approach B` had `compatibility issues`. This approach uses `C` which is `efficient` and `standard`.”)
How did you test it? (e.g., “Added `test_case_for_x_y_z` which now passes. Verified with `dataset_D` on `GPU_E` for `F` epochs.”)
Any potential side effects or considerations? (e.g., “This might slightly increase memory usage for very large inputs, but the accuracy gain is significant.”)

Here’s a simplified example of a good PR description structure:


**Summary:** Fixes an issue where `ModelName`'s attention mask was incorrectly applied, leading to suboptimal performance on sequences with padding.

**Problem:**
When using `ModelName` with batched sequences of varying lengths and padding tokens, the attention mask was not correctly propagated through `LayerX`, resulting in attention being applied to padding tokens. This manifested as lower accuracy during fine-tuning on `DatasetY`.

**Solution:**
Modified `ModelName.forward()` in `src/model_name/modeling.py` to ensure the attention mask is explicitly passed to `LayerX.forward()`. Specifically, added `attention_mask=attention_mask` to the call site.

**Testing:**
1. Added a new unit test: `test_attention_mask_propagation` in `tests/test_model_name.py`. This test constructs a padded batch and asserts that attention weights for padding tokens are zero.
2. Verified fix by fine-tuning `ModelName` on `DatasetY` for 3 epochs. Previous accuracy was 82.1%, now consistently achieves 85.3% on the validation set.

**Potential Impact:**
No known side effects. This change aligns with the expected behavior of attention mechanisms and is backward compatible.

c. Adhere to Code Style and Conventions

Use linters, formatters (like Black or Prettier), and follow the project’s established coding style. Nothing screams “I didn’t read the docs” more than a PR with wildly inconsistent formatting.

d. Write Clear and thorough Tests

For AI projects, this is non-negotiable. If you’re adding a feature, add tests for it. If you’re fixing a bug, add a test that reproduces the bug and then passes with your fix. Automated tests are a maintainer’s best friend; they provide confidence that your changes don’t break existing functionality and will continue to work in the future.

4. Be Responsive and Patient

Once you submit your PR, be prepared for feedback. Maintainers might ask for clarification, suggest alternative approaches, or point out issues. Respond promptly, politely, and incorporate their feedback. It’s a collaborative process. If it takes a few days for them to respond, that’s normal. They’re busy people!

I once had a PR sit for nearly two months before it was reviewed. I pinged it gently once after a month, but then just let it be. When it finally got reviewed, the feedback was super helpful, and it was merged a week later. Patience is a virtue here.

5. Consider the “Why” Before the “How”

Sometimes, what you think is a brilliant technical solution might not align with the project’s broader goals or design philosophy. Before embarking on a complex implementation, consider opening an “issue” or “discussion” first. Clearly articulate the problem you’re trying to solve and propose a high-level solution. This allows maintainers to provide guidance *before* you’ve invested hours in coding something that might get rejected.

This is especially true for new features or significant refactors. It’s a way of saying, “Hey, I think this would be a valuable addition/improvement. Are you open to discussing how it might fit in?”

Actionable Takeaways

Start small: Build credibility with minor contributions before tackling major features.
Read everything: Contribution guidelines, existing issues, and design docs are your roadmap.
Communicate clearly: Your PR description is as important as your code. Explain the problem, your solution, and how you tested it.
Test thoroughly: For AI projects, solid tests are non-negotiable proof of concept and stability.
Be patient and receptive: Open-source is a marathon, not a sprint. Feedback is a gift.
Think before you code: For larger ideas, open a discussion issue first to gauge interest and alignment.

Contributing to open-source AI is incredibly rewarding. It’s how we collectively push the boundaries of what’s possible, debug the impossible, and build the tools that enable the next generation of AI developers. By being thoughtful and strategic in our contributions, we not only increase our chances of getting our code merged but also foster a healthier, more collaborative open-source ecosystem. Now, go forth and contribute!

🕒 Last updated: March 26, 2026 · Originally published: March 15, 2026

👨‍💻

Written by Jake Chen

Developer advocate for the OpenClaw ecosystem. Writes tutorials, maintains SDKs, and helps developers ship AI agents faster.

Learn more →