GitHub Copilot available for JetBrains and Neovim

mewse · on Oct 27, 2021

I’ve never understood the value proposition for Copilot.

In terms of difficulty, writing code is maybe on average a two out of ten.

On average, maintaining code you wrote recently is probably a three out of ten in terms of difficulty, and maintaining code somebody else wrote or code from a long time ago probably rises to around a five out of ten.

Debugging misbehaving code is probably a seven out of ten or higher.

GitHub Copilot is optimising the part of the process that was already the easiest, and makes the other parts harder because it moves you from the “I wrote this” path to the “somebody else wrote this” path.

Even during the initial write, it changes the writing process from programming (which is easy) to understanding somebody else’s code to ensure that it’s right before accepting the suggestion (which is much less easy). I just don’t understand how this is a net time/energy savings?

baby · on Oct 28, 2021

Did you try it? Because I've been using it for weeks and it makes me read these types of comments as "I don't understand the value of the internet" or "what's the purpose of owning a phone".

It's night and day if you have it enabled or not. There's just no question about the value proposition once you start using it.

I mean, you can tell comments here from people who have actually been using it, and people who have not tried it.

b215826 · on Oct 28, 2021

> Because I've been using it for weeks and it makes me read these types of comments as "I don't understand the value of the internet" or "what's the purpose of owning a phone".

And yet apart from making this very inaccurate comparison you haven't made any argument for why such a thing as Copilot would be useful to anyone. How do you personally find Copilot useful? And why do you think someone whose job demands more than copy/pasting boilerplate code should try Copilot? The onus is on you to convince the skeptics.

jpalomaki · on Oct 28, 2021

I think current co-pilot is less useful for person who entirely knows the language and libraries.

Right know I see more use for people who understand the language and libraries, but are frequently Googling ”how do I do xyz in P” (because they can’t recall certain things).

emodendroket · on Oct 28, 2021

Bouncing around languages and ecosystems is pretty common, isn't it?

erikbye · on Oct 28, 2021

For the average career developer, less common than being stuck with the same legacy code base and language/framework for years.

baby · on Oct 28, 2021

Yep. A lot of the negative comments seem to come from people who haven’t worked on any new technology/language/framework in years.

krageon · on Oct 28, 2021

Only within startups. Larger corporations have stable toolchains with lifetimes measured in years.

emodendroket · on Oct 28, 2021

Based on my experience at a very large corporation, not the case. I've had to work on more languages within a year here than any other job I've had.

minusSeven · on Oct 28, 2021

Almost everyone does search like that, its not about doing something but about finding best and easiest way to do something.

capableweb · on Oct 28, 2021

If you mainly deal with 1-3 languages on a daily basis that you have mastered, you don't routinely search for "How do I do xyz in P". Maybe if you're a junior or intermediate developer, or have a poor memory. But doing that frequently is a clear indicator that mastery have yet to been achieved.

It's not wrong or bad to search for help, but it doesn't indicate mastery of the language you're using.

sireat · on Oct 28, 2021

I would say if you work in a narrow domain with a single language then yes, you might not need much searching.

However if you routinely switch among 3-5 languages you will get confused by naming and idiomatic approaches.

Ex: 1. Was it toUpper or upper or upperCase ?

2. What was the most idiomatic way to filter some collection?

3. Was it justify-content and align-items or vice versa?

A good IDE will generally solve first

Presumably copilot should help with second by supplanting search.

Third one I do hope copilot would help there..

I would say not remembering the names of some method is not an indication of lack of mastery.

Even creators of popular libraries and programming languages have admitted they will use search to refresh their memory.

minusSeven · on Oct 29, 2021

Knowing a language or a tool doesn't mean you will always know the best or the smartest way to do something. This is not necessarily a test of your programming ability. And best practise is more often ever changing. Almost every language or tool is always ever changing and improving itself which best practise also keeps evolving.

Secondly you don't necessarily need to know or master a language or tool for every kind of work. You can just choose to learn as you go along with it, in which case knowing how to search and use the most effective way to do something is very useful.

wokwokwok · on Oct 28, 2021

> The onus is on you to convince the skeptics.

Is it?

Look, I don't care at all if you use copilot; you can use notepad to write your code if that floats your boat; do whatever you want.

What the parent post said is: Copilot is useful; it helps you write code with autocomplete suggestions.

If you think that you don't get productivity gains from an IDE or you're in the 'no IDE makes you more hardcore and better programmer so never program with an IDE' camp, we just have to agree to disagree.

So, I have no interest in that conversation.

...

However, there is a more interesting conversation here we can have:

Given that you have an IDE and use autocomplete:

1) Does Copilot give suggestions are meaningfully useful?

Yes. I honestly can't give you a better answer than that. Yes, it does, it's quite good.

If you don't believe me, try it.

2) Is it better than regular autocomplete?

Look, forget the 'Look, I typed 'process user form and display on UI' and it autocompleted a whole application for me!!' hype.

That's stupid and that's not how it works.

It's an autocomplete. It can autocomplete large chunks, but they're generally rubbish. ...but it does two very interesting things:

- It suggests things that are contextually correct.

For example: even though its rubbish at C++ syntax, it generates valid UPROPERTY and UFUNCTION blocks for unreal code. If I write a `for (y = 0;...` on an array, it generates the associated `for (x = 0;...` when I'm iterating over a 2D array.

If I have a function which takes a pointer like:

> UFvContainerWidget* UFvContainerWidget::ForEquipmentDef(UFvEquipmentDef* EquipmentDef, bool& IsValid)

I press enter and it pops up:

    if (!EquipmentDef) {
      IsValid = false; <--- WTF! 
      return nullptr;
    }

Sure, it's a similar pattern to other code in related files, but still. This surprised me. I've never encountered an autocomplete that does that before.

Sometimes the suggestions don't make sense, and the larger the chunk, the less sense they make.

...but the suggestions that do make sense, make you regret not having it when you don't have it.

Like... regular autocomplete.

It's just a tool; it works very well at small scale autocomplete tasks.

- It can suggest comments from code as well as code from comments.

Literally, I can go above a function and type "/*" and it'll suggest a comment.

These don't always make sense, but often they're pretty close, and it saves me 20 seconds typing.

You have to carefully read these comments, because they tend to get random crap in them, but once again, for short comments its not bad.

Again... it's surprisingly good. Not perfect. It doesn't write your comments for you... but, it's easy to get into the habit of getting it to autocomplete "Returns null if the object is invalid" for you instead of typing it out.

3) Should I use it?

Look, I literally do not care if you do or don't.

What I take issue with is people saying 'it has no value'.

Does autocomplete have a value? Then this has a value.

Saying it has no value is just trolling.

Is it worth the cost?

Well, it's free to use right now.... so, well, you can't beat free right? :)

Longer term, would I pay for it?

Probably (** Personal opinion only: Maybe... you should try it and decide for yourself?)

riezebos · on Oct 28, 2021

I'm probably going to use it, but isn't it valuable to be a bit skeptical and question what the long-term effects may be?

Over the past decades we went from snail mail to email to instant messaging, each step made it easier to write text to a person. Today, we are writing so many messages to each other that people have started arguing for less instant messaging and less email. Mainly because distractions and frequent context shifts allegedly reduce productivity and happiness.

With Copilot, we have a similar evolution where writing code becomes easier. Could this result in people writing more and more code since it is a smaller effort? What would this do to the developer ecosystem in the long term? Maybe code reviews will take longer because there is more code and because it is more likely for junior developers to introduce bugs using copilot code. Maybe this results in more bugs slipping through code reviews and into production and eventually lower productivity and happiness since more time is spent stress-fully fixing production errors. I can't predict the future, but I do think it's valuable to ask these questions before it's too late.

bjarneh · on Oct 28, 2021

> Literally, I can go above a function and type "/*" and it'll suggest a comment.

Finally we can getting somewhere with this AI stuff....

ranguna · on Oct 28, 2021

> Well, it's free to use right now....

From copilot "additional telemetry"

> If you use GitHub Copilot, the GitHub Copilot extension/plugin will collect usage information about events generated by interacting with the integrated development environment (IDE). These events include GitHub Copilot performance, features used, and suggestions accepted, modified and accepted, or dismissed. This information may include personal data, including your User Personal Information

So, not really free is it?

wokwokwok · on Oct 28, 2021

> Should I use it?

> Look, I literally do not care if you do or don't.

> What I take issue with is people saying 'it has no value'.

KronisLV · on Oct 28, 2021

It depends on semantics and your interpretation of value.

In the eyes of most people free == 'something that i don't have to pay for through my bank accounts or other means', as opposed to caring about analytics, telemetry etc.

At least AFAIK that's the common usage and what almost everyone means, though it's definitely worth it to talk in more detail about what hides under that term most of the time!

Aeolun · on Oct 28, 2021

> suggestions accepted, modified and accepted, or dismissed. This information may include personal data, including your User Personal Information

Seriously wtf, my legal department will have a heart attack if they read this.

baby · on Oct 28, 2021

Yes it’s free

blub · on Oct 28, 2021

That sounds about as useful as Tesla's "auto-pilot" - good when it works but you have to always pay attention that it's not trying to kill you (or your code in this case).

A bad proposition for most people, because you can't trust it.

dham · on Oct 28, 2021

The whole trust thing is an interesting topic. I thought the same thing but then I got Open Pilot. I would feel comfortable falling asleep with that on at this point. It takes time though. Around 400 miles for me before I "fully" trust.

baby · on Oct 28, 2021

Sounds like you haven’t tried it :)

baby · on Oct 28, 2021

First, why would I need to argue this? Just try it and see if it’s useful for you. I know people who don’t use IDEs or don’t use syntax highlighting and it doesn’t seem to bother them much, so who knows.

Second, I’ve actually answered elsewhere in this thread.

gremlinsinc · on Oct 29, 2021

you can sometimes just tap enter, inside a model or another file (in a framework like Laravel for example), and it'll literally guess the entire function. at the most If you just make a comment about what the function should do, or name the function something sensible it'll get the whole function maybe 58% of the time, and when it's wrong there's other options to choose from and one might get you 90% to the end goal, and just need a few modifications.

I've used kite, and tabnine and loved tabnine, but this is something different and way more like magic. I can't explain how, it just feels like it's reading my brain as I type..

armchairhacker · on Oct 28, 2021

I've only used it a bit and it's like glorified autocompletion. It messes up a lot in some ways too, often suggesting getFoo when in kotlin we just use foo.

It is really fun to see it sort of understand your code though, and every once in a while it does a smart suggestion that could've taken a while for me to figure out myself.

Gigachad · on Oct 28, 2021

It's certainly a good autocomplete but it can't be used for anything more complex because you just can't trust it. Every now and then it produces an entire function filled with complexity and I think "This looks right but I would need to independently verify everything" and at that point it's easier to write from scratch.

baby · on Oct 28, 2021

Whenever it happens it’s actually a good foundation as it still does a lot of boilerplate

Rd6n6 · on Oct 28, 2021

Are you worried about copyright issues from the code it produces?

Gigachad · on Oct 28, 2021

It only spits out functions at most 10 lines long. There is very little that could actually be copywritten in a 10 line block. And most of those suggestions are incredibly generic problems like find the distance between two coords.

krageon · on Oct 28, 2021

This is a cloud-based code suggestion platform. No corporation with a solid secrecy policy will allow you to use it. For private use; it costs so I prefer just learning the field. What precisely convinced you?

tornato7 · on Oct 28, 2021

Plenty of corporations trust GitHub itself with their code, why should trusting copilot be any different?

krageon · on Oct 29, 2021

Did you comment on the wrong post?

sapphire_tomb · on Oct 28, 2021

There's so many people in here saying "did you try it?" - and I signed up on the waiting list the day it was announced and still don't have access.

PrincessJess · on Oct 28, 2021

Was in the same boat! Just got approved literally 2 days ago. And I have tons of activity...

stoicjumbotron · on Oct 29, 2021

Could not have said it any better. Enabled it a couple of days ago and I was flabbergasted at the results it produced for me.

deckard1 · on Oct 27, 2021

I don't see it either. The context switch between being in the zone/flow and writing the exact code I'm thinking of to suddenly reviewing blocks of foreign and quite possibly wrong code seems like a net negative value proposition. I can't even get autocorrect on my phone to do the right thing half the time.

Writing code is easy. Architecture, refactoring, and solving business problems are the hard parts of the job.

Writing new code is also generally the most rewarding aspect of the job. Co-pilot promises to turn that into just another unrewarding chore, like slinging 3rd party libraries together.

jackbrookes · on Oct 28, 2021

Its very useful for certain types of tasks that are inherently repetitive. It will construct debug strings very easily, for example

e.g. type `pr` in a function with x & t

and it may predict...

The `print(f"The value of 'x' was {x} at time {t:0.00}s")`

karmasimida · on Oct 28, 2021

There are some code in your life you would wish someone could write that for you.

Because such code is dumb, tedious and joyless. I have to bite my lip sometimes to convince myself writing them is not a waste of time, because people demand it, but I hate it to my core.

Copilot is that unfortunate boy that has to do all that manual work. It is the ultimate code boilerplate mixer.

It is not going to write all code for you, if you goal is to have it THINK for your then you are due to disappointment. But it would be able to help you be a more efficient and slightly happier programmer.

liquidwax · on Oct 27, 2021

Copilot also optimizes for speed to a degree. It's akin to advanced auto complete. IntelliJ auto-completion is great. As much as it pains to say this, I don't think I will be as effective writing Java in Vim as much as I am with IntelliJ. The key differentiator is the auto complete speed. Copilot I feel is just auto complete on steroids. It may not be perfect yet, but there is definitely a problem it solves.

JohnAaronNelson · on Oct 27, 2021

Have you used it? My experience was quite atrocious. Copilot is not auto complete. It’s nonsense. I attempted to use it continuously for three weeks. I tried because I know someone who built it and I wanted to give them the benefit of the doubt.

It never prompted me with any code that was useful. It only ever slowed me down and caused me frustration. It’s nothing like Intellisense. It’s just trash.

baby · on Oct 28, 2021

I pretty much use it every day at this point and I notice when it is disabled.

> It’s just trash

I have a hard time relating to this kind of experience considering how useful it has been for me. What language are you writing in btw? When I use it for OCaml it's not that useful, perhaps because there isn't as much OCaml code to learn from :D

JohnAaronNelson · on Oct 30, 2021

A lot of TS React or Node

throwaway675309 · on Oct 29, 2021

Could you cite an actual specific example? I have a difficult time believing what you are asserting that it provided absolutely no value at any point - it just sounds like a baseless ad hominem attack.

I find that it's fantastic for typescript and JavaScript allowing me to flesh out the completion of basic data object containers class definitions etc. extremely quickly.

If I don't remember the exact parameters that you have to pass into a certain NPM packages methods it will usually auto prompt me and help me complete it without me having to context switch to a browser and look it up.

JohnAaronNelson · on Oct 30, 2021

https://johnaaronnelson.com/i-cant-anymore-with-copilot/

TLDR; the subtleness of its wrongness destroys my ability to follow my train of thought. I always had to take myself out of my train of thought and evaluate the correctness of the suggestion instead of just writing.

For simple things, intellisense, autocomplete and snippets are far more effective.

For anything more complex, I already know what I want to write.

For exploratory stuff I RTFM

copilot was ineffective at every level for me.

FinalBriefing · on Oct 28, 2021

It's helped me out quite a bit.

When I'm writing Angular code, it often fills in the correct boilerplate code, and is especially helpful when writing unit tests. I'm also quite surprised when it autocomplete various filter functions.

It isn't perfect, but it's been helpful filling in the mundane, simple stuff.

JohnAaronNelson · on Oct 30, 2021

Sorry to tell you, but if you’re constantly, or even regularly, writing this much boilerplate code, then you probably need to change how you write code. Maybe try a different framework.

Aeolun · on Oct 28, 2021

Try tabnine? It doesn’t generate so much nonsense because it’s all generated based on only your own codebase.

JohnAaronNelson · on Oct 30, 2021

Tried it a bit. Not useful for me.

TedDoesntTalk · on Oct 28, 2021

I pair program with a guy who completely refuses to use the keyboard unless there is no other option. He uses the mouse to cut/copy/paste everything possible. He is not handicapped. It is frustrating for me to pair program with him because of that.

He will spend an extra 3-5 seconds using his mouse in order to avoid typing.

Perhaps he is the target market.

jgwil2 · on Oct 28, 2021

Then making it work with Neovim seems like an odd choice.

arvindamirtaa · on Oct 28, 2021

The only drawback is....constraints. Autocomplete constrains suggestions to those that are valid or at least valid-adjacent (like it'll use something but auto-import it to make it valid, etc). Copilot fails miserably here and I don't yet see it improving anytime soon. Maybe it will, and if it does, it'll be great. But I won't hold my breath for it.

taormina · on Oct 27, 2021

I mean, why only pick vim or IntelliJ? I get them both with IntelliJ and the plugin.

chii · on Oct 27, 2021

> value proposition for Copilot.

now, instead of copying off stackoverflow, it's gonna be off copilot. It will enable a lot more people to code who otherwise would not. Whether this is a good outcome or not...

BigJono · on Oct 28, 2021

This is even another example of just optimising for the easy bit.

I could hire 50 juniors that can code tomorrow if I wanted to. But even with an unlimited budget, finding good devs that can make it through a 2 year project without coming out of it with a big ball of unmaintainable shit is difficult.

The gulf from beginner to expert is already big, and the more crutches you use early on, the bigger it's going to get. There's a lot of people that wash out of the industry before they reach the point of being able to comfortably build good software (and be solely responsible for it).

I think copilot is another item in a long list of things that's good for big businesses (who optimise heavily for getting passable results with 1,000 mediocre devs instead of 50 good ones) and terrible for individuals in the long run.

ffhhj · on Oct 28, 2021

Sooner than later using copilot will be an interview question and it will trigger a big red flag if the company cares about talent.

baby · on Oct 28, 2021

Your comment hides some truth! Imagine coding today without stackoverflow. Possible, but you'd lose so much time looking for simple answers.

nrabulinski · on Oct 28, 2021

The more experience I gain the less I use SO and more I just go to the sources or read the docs.

With googling for SO answers I have to parse the question, find a modern answer (because the accepted one is 10 years old and won’t work), parse that and adapt it to my problem. With documentation I just search for what I need and go straight to solving my problem and I’ve never felt more productive.

I feel like people new to programming focus too much on a specific problem at hand instead of learning the problem solving themselves. I wish I would’ve learned to figure out the issue myself from the start.

baby · on Oct 28, 2021

I feel like jobs where you can constantly rely on your already acrued knowledge are rare. Or maybe it’s just me and I work in fields where I have to learn new technologies constantly?

Recently I’ve been doing a lot of OCaml and it’s been tough as there’s very little on stackoverflow. Every time I have a question I have to spend a lot of effort looking for the answer instead of relying on someone before me having the same problem and posting the answer online.

chii · on Oct 28, 2021

> programming focus too much on a specific problem at hand instead of learning the problem solving themselves

you got the importance and focus wrong. People don't care for problem solving skills - they care for the specific problem being solved. That's why they pay someone to fix it.

They don't want to pay someone to "learn" problem solving (because the stakeholder don't care).

Stackoverflow immensely helped this sort of use-case - may be at the detriment of quality - but i cannot deny that copilot is going to accelerate this use-case.

toastal · on Oct 28, 2021

Same. I was wanting to learn more about ActivityPub recently, and after reading the first two web search results, I remembered: this'll probably just be easier if I just read parts of the W3 spec (and it was)

mianos · on Oct 28, 2021

My use of stackoverflow has vastly faded over time. I only ever go there to look for hints on ways to do something better, better practice, newer ways or a simpler way.

There are more and more wrong answers posted by what looks like the same people who have MVP on the Apple, Microsoft and Google forums who just whore for kudos points. I can't understand the motivation to dilute something of value.

bostik · on Oct 28, 2021

And you'll get the same "works for me" results as you would from SO. I put my thoughts down a while back: https://bostik.iki.fi/aivoituksia/random/minimum-viable-copy...

zoomablemind · on Oct 28, 2021

>...now, instead of copying off stackoverflow, it's gonna be off copilot.

Eventually, I'm seeing another breed of SO questions, making sense of Copilot suggested fragments and seeking reassurance or alternatives... Then possibly the copying off, just as now.

namelosw · on Oct 28, 2021

By the same logic you've presented, what's the value proposition of the plain-old auto-completion? What's the value proposition of a slick editor? All you need is the built-in notepad and a debugger.

Speaking from my personal experience, I usually write code in TDD style, in which I test the properties of the software I desire upfront, then make it pass with a minimal amount of effort. When I see there's a need for refactoring, I refactor. And I repeat this process until it is done.

The three parts take roughly the same portion of time, and when I'm writing tests, I'm thinking about the functionality and value of the software. When I'm refactoring I'm thinking about the design. When I'm writing the implementation initially, I want it to Just Work™ in the first place, and I find Copilot is great for this matter: why not delegate the boring part to the machine?

KronisLV · on Oct 28, 2021

You know, perhaps this is tangential to the point that you're making at best, but i still couldn't help but to notice:

> The three parts take roughly the same portion of time, and when I'm writing tests

that bit and have some strong feelings about it. At my current dayjob, writing tests (if it was even done for all code) would easily take anywhere between 50% and 75% of the total development time.

I wish things were easy enough for writing test code not to be a total slog, but sadly there are too many factors in place:

  - what should the test class be annotated with and which bits of the Spring context (Java) will get started with it
  - i can't test the DB because the tests don't have a local one with 100% automated migrations, nor an in memory one because of the need to use Oracle, so i need to prevent it from ever being called
  - that said, the logic that i need to test involves at least 5 to 10 different service calls, which them use another 5 to 20 DB mappers (myBatis) and possibly dozens of different DB calls
  - and when i finally figure out what i want to test, the logic for mocking will definitely fail the first time due to Mockito idiosyncrasies
  - after that's been resolved, i'll probably need to stub out a whole bunch of fake DB calls, that will return deeply nested data structures
  - of course, i still need all of this to make sense, since the DB is full of EAV and OTLT patterns (https://tonyandrews.blogspot.com/2004/10/otlt-and-eav-two-big-design-mistakes.html) as opposed to proper foreign keys (instead you end up with something like target_table and target_table_row_id, except named way worse and not containing a table name but some enum that's stored in the app, so you can't just figure out how everything works without looking through both)
  - and once i've finally mocked all of the service calls, DB calls and data initialization, there's also validation logic that does its own service calls which may or may not be the same, thus doubling the work
  - of course, the validators are initialized based on reflection and target types, such as EntityValidator being injected, however actually being one of ~100 supported subclasses, which may or may not be the ones you expect due to years of cruft, you can't just do ctrl+click to open the definition, since that opens the superclass not the subclass
  - and once all of that works, you have to hope that 95% of the test code that vaguely correseponds to what the application would actually be doing won't fail at any number of points, just so you can do one assertion

I'm not quite sure how things can get that bad or how people can architect systems to be coupled like that in the first place, but at the conclusion of my quasi-rant i'd like to suggest that many of the systems out there definitely aren't easily testable or testable at all.

That said, it's nice that at least your workflow works out like that!

disgruntledphd2 · on Oct 28, 2021

Have you read Working Effectively with Legacy Code?

It's transformative in situations like this, it has a bunch of recipes for solving these kinds of problems.

While I don't use Java or C++, this book has probably been the most useful to me in working with larger bodies of code.

KronisLV · on Oct 28, 2021

While the book is indeed good, it's pretty hard to do anything to improve that particular codebase because there are developers who are actively introducing more and more of the problematic patterns and practices even as i write this.

To them it isn't "legacy code" but just "code", while attempting to offer alternatives either earns you blank stares or expressing concerns about anything new causing inconsistencies with the old (which is a valid concern but also doesn't help when the supposedly consistent code is unusable).

To me it feels like it's also a social problem, not just a technical one and if your hands are essentially tied im that situation and you fail to get the other devs and managers onboard, then you'll simply have to either be very patient or let your legs do the work instead.

namelosw · on Oct 28, 2021

Thanks for sharing this. I can feel you because I have been working on a similar project but slightly better, however, it's painful still for me. I wrote a comment last month [0] that is more or less related to what you've said. Basically, you want to write fewer tests that really matter, while the infrastructure should be fast and parallelizable.

Sadly it's easier said than done, since it's not an easy thing to fix for an existing system. We've spent quite some time improving things to ease the pain on writing tests, it was getting better but would never reach the level if we were aware of this problem in the first place - there are tens of thousand tests and we cannot rewrite them all.

I'm not too familiar with your tech stack. But there are two things you mentioned that are especially tricky to handle for testing: DB and service calls.

For DB, there are typically two ways to handle it: Use real DB, or mock it.

Real DB makes people more confident, and don't need to mock too many things. The problem is it can be slow and not parallelizable, or worse, like your case there's no impotent environment at all. We had automated migrations, but the test was run against the SQL Server on the same machine, so it was not parallelizable so the tests took more than a day to run on a single machine. On CI there are tens of machines but still takes hours to finish. In the end, we generalized things a little bit, and used SQLite for testing in a parallel manner. (Many people suggest against this because it's different from production, but the tradeoff really saved us). A more ideal approach is to have SQL sandboxing like Ecto (written in Elixir). Another ideal approach is to have in memory lib that is close to DB, for example, the ORM Entity Framework has an in-memory implementation, which is extremely handy because it's written in C# itself.

If there's no way to leverage real DB, you have to mock it. One thing that might help you is to leverage the Inversion of Control pattern to deal with DB access, there are many doctrines like DDD repositories, Hexagonal, Clean Architecture but essentially they're similar on this point. In this way, you'll have a clean layer to mock, and you can hide the patterns like EAV under those modules. As you leverage them enough, they will evolve and there would be helpers that could simplify the mocking process. According to your description, the best bet I would say is to evolve toward this direction if there's no hope on using real DBs, as you can tuck as much as domain logic into the "core" without touching any of the infrastructures. So that the infrastructure tests could be just very simple and generic.

For service calls, the obvious thing is to mock those calls. The not so obvious thing is to have well-defined service boundaries in the first place. I cannot stress this enough. When people failed to do this, they will feel they're spending a lot of time mocking services, while at the same time they feel they've tested nothing because most things are mocked. Microservices were getting too much hype over the years, but very few people pay enough attention on how to define services boundaries. The ideal microservice should be mostly independent, while occasionally calling others. DDD strategic design is a great tool for designing good service boundaries (while DDD tactic design is yet another hype, just like how people care more about Jira than real Agile, making good things toxic). We were still struggling with this because refactoring microservice is substantially harder than refactoring code within services, but we do try to avoid more mistakes by carefully designing bounded contexts across the system.

With that said, when the service boundaries are well-defined, and if you have things like SQL sandboxing, it's a breeze to test things because most of the data you're testing against is in the same service's DB, and there are very few service calls need to be mocked.

[0] https://news.ycombinator.com/item?id=28642506#28679372

pizza · on Oct 28, 2021

The value prop for things like these is always the same: for the widely - and accurately, although that's irrelevant to my point - lambasted initial release to start the ten year journey down the road towards the creation of an eventual product that will make people link to comments like these the way they link to the original Dropbox Show HN: post.

There are levels of ease of which we have not yet dreamed, especially in the realm of information manipulation.

mewse · on Oct 28, 2021

I mean, I'm obviously intensely skeptical that such a thing will happen at all, much less within ten years.

But I guess we'll find out eventually! And if mine does become the "640k ought to be enough for anybody" quote of this decade, then I suppose there are worse kinds of fame.

fnoof · on Oct 27, 2021

I think professional software devs won’t get much value from copilot.

OpenAI’s demo from a few months back showed it as a sort of bridge to convert natural language instructions into APIs calls. Eg converting “make all the headings bold” to calls to a word doc api.

arvindamirtaa · on Oct 28, 2021

It can still do that. Write a small comment before writing CSS and see it go!

highwind · on Oct 28, 2021

I feel like 10 years from now we will look at thread like this and laugh akin to 64kb being enough.

tmp12g · on Oct 28, 2021

640kb, not 64kb, is the value used in that apocryphal quote. A quick google search would have shown that. I wonder if there's a google copilot in the works for social media posts.

Jensson · on Oct 28, 2021

We do look 40 years back and laugh at people thinking that artificial intelligence will be easy.

baby · on Oct 28, 2021

Exactly, I'm already laughing reading the comments here considering I've been using copilot for weeks and it's a game changer

ArtWomb · on Oct 28, 2021

>>> in terms of difficulty, writing code is maybe on average a two out of ten

Imagine what's possible when that difficulty level shrinks to 0.0001 / 10

Github Copilot is a "code synthesizer"

Xmas time takes me back to one of the most popular "toys" of all time: the Casio SK-1. Music sampler for the masses.

It's like that ;)

rubyist5eva · on Oct 27, 2021

This is for outsourcing farms to dump even more garbage code on ignorant founders for pennies.

zhangjunphy · on Oct 28, 2021

I guess it really depends on what you are working on.

If you are writing some non-trivial algorithms or working on some projects which requires delicate handling of things, then Copilot is most likely going to mess up.

But if you are working on many of those frontend code or backend CRUDs which are usually quite repetitive. Then Copilot could be helpful.

emodendroket · on Oct 28, 2021

It's meant to be a souped-up autocomplete. You don't quite remember how to do a common thing, and instead of having to go look it up, the IDE suggests it for you and you can keep doing what you're doing. A bunch of small instances of that can save you lots of time.

ingvul · on Oct 28, 2021

> I just don’t understand how this is a net time/energy savings?

At the end of the day is all about trust. Do you trust code you find in SO/Copilot to be good enough for your use case?

In my case I do not trust SO code. Whenever I use SO, if I find some snippet that seems to be the code solution I'm looking for, I copy-paste the snippet on my IDE, read through it carefully, rename variable names as needed, handle edge cases, remove unused code, etc., etc. Any code solution I find in SO gives me the "starting" kick, which is about 10% of the total effort of writing code from scratch. The remaining 90% (to understand the code that is being committed) cannot simply go away. I do not expect Copilot will make much of a difference.

erikbye · on Oct 28, 2021

> In terms of difficulty, writing code is maybe on average a two out of ten.

It's not about difficulty, but time. Not the same thing. Easy can still be time consuming.

Have you seen how much time the average developer spends on Stack Overflow and googling for answers?

sireat · on Oct 28, 2021

I also fear that Copilot will be teaching anti-patterns.

Just tried something really simple: def is_palindrome

Copilot suggestion was

  def is_palindrome(word):
      if word == word[::-1]:
          return True
      else:
          return False

facepalm

So good for technically correct solution but still...

This is an anti-pattern I think in pretty much any language that I know of and something that about half of my beginning students try when they learn about branching..

UPDATE: more howlers along the same vein

  def haystack_contains_needle(haystack, needle):
      if needle in haystack:
          return True
      else:
          return False

sireat · on Oct 28, 2021

UPDATE: The above howlers were in IntelliJ...

However in Visual Studio Code on a different computer, I got much better idiomatic suggestions.

such as

  def is_palindrome(word):
      return word == word[::-1]

Very puzzling.

arvindamirtaa · on Oct 28, 2021

It's not so much the parts where think hard and implement that perfect feature. It's insanely useful when you have to make sweeping pesky changes.

Like a pulling values from a config dict and initializing a bunch of methods from the class? Or setting up a testcase similar to the one you already have, but with different values? Or cleaning values from a form? It's not a bulk edit. But it's also not thoughtful code-writing. It's monotonous and mundane. And lots of people do a lot of this on a daily basis.

Copilot makes this a cinch.

nfRfqX5n · on Oct 28, 2021

you're only thinking about the 'write a comment get a block of code' feature. it also has autocomplete/predictive functionality that speeds up coding quite a bit when it works

thereisnospork · on Oct 27, 2021

>GitHub Copilot is optimising the part of the process that was already the easiest, and makes the other parts harder because it moves you from the “I wrote this” path to the “somebody else wrote this” path.

It is worth mentioning, I suppose, that from Copilot's point of view it is the inverse. Maybe a necessary or at least desirable step towards the inevitable 'Copilot debugger'.

asxd · on Oct 28, 2021

I've had a lot of fun having it generate weird web pages without me writing any code.

Stuff like:

    // Add 100 divs to the DOM in random places
    ...
    // Randomize the color and text of all divs every 1 second
    ...

Other than that novelty, it can be genuinely useful if you think about it as a more intelligent autocomplete.

TedShiller · on Oct 28, 2021

> In terms of difficulty, writing code is maybe on average a two out of ten.

I’ve never heard talented programmers say that

voldemort1968 · on Oct 28, 2021

Yes, if you use copilot irresponsibly, you will end up with irresponsible code.

foolfoolz · on Oct 28, 2021

maintaining code someone else wrote is much higher on your rating scale. probably need the top end. because it nearly always involves some debugging and usually is not obvious what footguns exist

adminscoffee · on Oct 28, 2021

i've actually grown to like it, it seems like it's getting better with each use

longhand · on Oct 28, 2021

Maintaining code that you wrote rises to a nine out of ten. Debugging your code is a ten out of ten or higher.

megablast · on Oct 28, 2021

You never understood something that only came out this year??? Never???

zohch · on Oct 28, 2021

To make it faster to pump out go which is maybe > 50 noise with it's poor language features and it's anemic standard library that does not even have abs.

pugets · on Oct 27, 2021

Copilot is crazy. The other day, I was writing a Python function that would call a Wikipedia API. I pulled from the internet an example of a GET request, and pasted it as a comment in my code.

  # sample call: https://en.wikipedia.org/w/api.php?action=query&format=json&list=geosearch&gscoord=37.7891838%7C-122.4033522&gsradius=10000&gslimit=100

Then I defined a variable,

  base_url = "https://en.wikipedia.org/w/api.php?"

Then, like magic, Copilot suggested all the remaining keys that would go in the query params. It even knew which params were to be kept as-is, and which ones would come from my previous code:

  action = "query"  # action=query
  format = "json"  # or xml
  lat = str(latitude.value)  # 37.7891838
  lon = str(longitude.value)  # -122.4033522
  gscoord = lat + "%7C" + lon
  ...
  api_path = base_url + "action=" + action + "&format=" + format + ... + "&gscoord=" + gscoord

As a guy who gets easily distracted while programming, Copilot saves me a lot of time and keeps me engaged with my work. I can only imagine what it'll look like 10 years from now.

lucideer · on Oct 28, 2021

This comment is accidentally the perfect example of why copilot is a horrific idea.

The old "just copy-paste from Stack Overflow" approach to development is satirised and ridiculed these days (despite being still in common practice I'm certain), because as we all know so well by now, an accepted answer on SO does not always equate to a correct answer. Yes, the SO guys & community do do their best to improve answer quality iteratively (wiki answers, etc.), but there's still a lot of bad answers, and even many of the "good" ones become outdated or don't keep up with modern best-practice (especially when it comes to security).

Omitting urlencoding isn't the biggest crime, but it is a pretty standard URL-building step, and the fact that a tool released this year is spitting out code that omits something so simple is fairly damning. It's also a micro-example of much larger errors Copilot will surely be responsible for. Missing url encoding can be an injection vector in many applications, even if it's not the most common risk, but miss encoding in other string-building ops and you've made your way into the OWASP Top 10.

The big difference between copilot and SO is there's no community engaging in an open & transparent iterative process to improve the quality of answers.

omegalulw · on Oct 28, 2021

+1. URL encoding is also very relevant on the backend and has security implications, e.g., you want to be sure you are protecting against double encoded URLs. If you elide details like URL encoding using copilot that's dangerous.

janto · on Oct 27, 2021

For me APIs are actually one of the places it performs the worst. Copilot is like having an inexperienced yet annoyingly optimistic pair programmer. The code it generates appears conceivable in some hypothetical universe. No guarantee it's this one though.

Remember it doesn't actually know an API or how it should be used: it's putting things together to look like typical code. For me that has meant difficult to spot bugs like linking up incorrect variables from the rest of my code.

I wish it could integrate the first SO answer to a generated question, because I always end up there anyway having to fix things.

pc86 · on Oct 28, 2021

I think my experience has been sort of between you two. Maybe 1/3 times it's spot on. The rest of the time, there is some minor tweak I need to make (it gets a parameter or variable name wrong). I've yet to hit cases where the code it generates looks right but doesn't run as expected, thankfully.

I've only had it for about a week now but overall I'm happy with it. None of the code I'm writing is crazy cutting-edge stuff and in aggregate I'm sure it saves me more time than takes, including the time I spend reviewing and potentially changing the generated code.

shmoogy · on Oct 27, 2021

That's a bummer. I just got whitelisted and was hoping it could save me time with some APIs where they only have code in X language or curl and I have to work backwards if I run into any issues.

cedws · on Oct 27, 2021

Bit of a dodgy way to form query parameters though. Other than for a quick script.

jtsiskin · on Oct 27, 2021

Even for a quick script this worries me about copilot; if it suggests this, then more people use it and think this is right, commit it, and then copilot suggests this more - that’s a bad feedback loop. At least in StackOverflow you get someone commenting why it’s bad and showing how to use a dictionary instead

omegalulw · on Oct 28, 2021

I think they only pick starred repos not truly in the wild code. That's not a guarantee of good code but still a decent check.

MatthiasPortzel · on Oct 27, 2021

I'm not against "copying" code. I just looked up "python build url query" The first link describes the `urllib.parse. urlencode` function which takes a dict.

So I would build the query like so:

    from urllib.parse import urlencode
    urlencode({
        "action": "query",
        "format": "json",
        ...
        "gscoord": f"{str(latitude.value)}|{str(longitude.value)}",
    })

I think this is orders of magnitude clearer code. But that's a parameter that's subjective that CoPilot can't adjust for (although it can be better).

e0a74c · on Oct 27, 2021

I'm surprised no one has suggested using `requests` considering how easy, safe and readable it is:

    >>> import requests, pprint
    >>> 
    >>> 
    >>> url = "https://en.wikipedia.org/w/api.php"
    >>> resp = requests.get(
    ...     url, 
    ...     params=dict(
    ...         action="query",
    ...         list="geosearch",
    ...         format="json",
    ...         gsradius=10000,
    ...         gscoord=f"{latitude.value}|{longitude.value}"
    ...     )
    ... )
    >>> 
    >>> pprint.pprint(resp.json())
    {'batchcomplete': '',
     'query': {'geosearch': [{'dist': 26.2,
                              'lat': 37.7868194444444,
                              'lon': -122.399905555556,
                              'ns': 0,
    ...

thamer · on Oct 27, 2021

For what it's worth, Copilot can do it.

I typed the following prompt:

    def search_wikipedia(lat, lon):
        """
        use "requests" to do a geosearch on Wikipedia and pretty-print the resulting JSON
        """

And it completed it with:

    r = requests.get('https://en.wikipedia.org/w/api.php?action=query&list=geosearch&gsradius=10000&gscoord={0}|{1}&gslimit=20&format=json'.format(lat, lon))
    pprint.pprint(r.json())

esjeon · on Oct 28, 2021

It's like a junior dev who doesn't quit unnecessary code golfing. Somehow the AI is more comfortable with string-based URL manipulation, which is a straight anti-pattern.

disgruntledphd2 · on Oct 28, 2021

Presumably because that's what it's seen in the training data. Remember, it doesn't care about what the code does, it's just doing a search for similar looking code.

_clhx · on Oct 27, 2021

That doesn't exactly do what the guy above you was talking about, though.

grenoire · on Oct 28, 2021

That's what the rest of the thread is complaining about, it's still slapping the strings in there with basic formatting. No different than the top level approach.

lambdaba · on Oct 27, 2021

This. Code should be optimized for reading, I think this kind of code is OK for exploratory stuff, but needs to be rewritten later.

snicker7 · on Oct 27, 2021

Well. Code should be optimized first for correctness, and simple string concatenation will not work for URL params.

_clhx · on Oct 27, 2021

It'll certainly work, just seems sloppy.

bennyg · on Oct 27, 2021

Plenty of edge cases there (e.g. url encoding), but I don't want to preach to the choir and rabbit hole on this minor detail.

sillysaurusx · on Oct 27, 2021

Speaking as a former pentester, this is a fine way to form query params in this specific case, if lat and long are floats.

They're the only data you can control, and unless they're strings, it's useless for exploitation. Even denormal floats / INF / NAN won't help achieve an objective.

I broadly agree with you, but people are pummeling Copilot for writing code that I saw hundreds of times. Yes, sometimes I was able to exploit some of that code. But the details matter.

thrashh · on Oct 27, 2021

But I would still never not escape the params because you don’t know how that code will change one day or where it will end up, and chances are that you won’t remember to fix it later if you don’t fix it now.

We just had a major failure at work recently because someone decided to not decode URL params and their code worked fine for years because it never mattered… until it did.

Just do it right. It’s so easy. Why risk yourself a ton of headache in the future to save you a few seconds?

boredtofears · on Oct 27, 2021

If the example code is everything that Copilot generated, there's no guarantee that lat or long are floats and that seems to be an implementation detail left to the user.

Isn't that a pretty big risk though? Specifically, that people will use co-pilot recommendations "as-is" and give little thought to the actual workings of the recommendation?

After all, if you have to intimately understand the code it's recommending are you really saving that much time over vetting a Googled solution yourself?

lijogdfljk · on Oct 27, 2021

How so? I'd prefer a proper structured library, is that what you mean? If so, the Copilot code actually seems not dodgy - because the author _started_ with `base = ...` , indicating that they were string formatting the params.

Or did you mean something else?

pugets · on Oct 27, 2021

It probably does suck! I’m not very experienced, and I was just whipping up something quick to test if my random MSFS2020 mod idea could work.

ljm · on Oct 27, 2021

Copilot may be the first of many computer dictionaries and thesauruses.

Oxford English Dictionary, for example, is a human version of defining language and a thesaurus is a completion engine.

Human language didn't suffer by having a dictionary and thesaurus. Computer language doesn't suffer either.

hanniabu · on Oct 28, 2021

As a noob, what's the issue with this?

lolinder · on Oct 28, 2021

It's harder to read than other methods, and it doesn't encode the URL parameters, which means it potentially produces an invalid URL, and in some cases could lead to security problems (similar to SQL injection).

relativeadv · on Oct 27, 2021

How so?

firebaze · on Oct 27, 2021

Concatenating strings for example. As shown, it's the query string equivalent of sql injection.

Use something like URLBuilder, or URIParams, or whatever your platform supports. Don't use string concatenation ever, if at all possible, and if not possible (wtf?), then at least escape strings.

cmckn · on Oct 27, 2021

I usually try to avoid working with URLs as bare strings like this, both for readability and correctness (URL encoding is tricky). With ‘requests’ you can do something like pass a dictionary of your query params and it takes care of forming the actual request URL.

https://docs.python-requests.org/en/latest/user/quickstart/#...

gen220 · on Oct 27, 2021

It's much safer (i.e. fewer degrees of freedom for bugs to appear) to use f-strings in situations like this one.

One correlated but ancillary benefit, is that there are fewer variables to simulate the state for in your brain, while you're reading the code. You don't have to wonder if a variable is going to change on you, in-between when it is initialized and when it is used.

It's safer still to use a library (e.g. urllib3) that does encoding for you (allowing you to omit magic strings like `"%7C"` from the logic of this function alltogther).

Like GP said, very handy for one-off scripts or areas of your codebase where quality is "less important". I may be pedantic, but I wouldn't give this a pass on code review.

Jon_Lowtek · on Oct 27, 2021

code lacks context sensitive escaping

  api_path = base_url + urllib.parse.urlencode({
    'action': action,
    'format': letThisBeVariable,
    ...
    'gscoord': str(latitude.value) + '|' + str(longitude.value)
  })

see: https://docs.python.org/3/library/urllib.parse.html#urllib.p...

Mantra: when inserting data into a context (like an url) escape the data for that context.

foepys · on Oct 27, 2021

It has no escape logic. Okay for scripts, as GP stated, very bad for production code.

dastbe · on Oct 27, 2021

the "nice" way of doing this would would be to create a list of your stringified arguments, mapped urlencoding over them, and then join them with the parameter separator. this ends up being resilient to someone adding something that ends up being incorrect, and makes explicit in the code what you're trying to do.

anigbrowl · on Oct 27, 2021

That's impressive. Discoverability and the varying quality of documentation is a big headache for new programmers or people engaging with a an unfamiliar framework/API/library. I really like the comments pointing out alternatives (json or xml) and the static lat-long values.

One reason I've championed the development of visual programming (flow-based, node diagrams, etc) is that while you don't want to compress a big complex program down into a single layer and become unreadable in the process, graphical methods are a great way for people to see what options they have available and just point at things they want to try out.

Instead of struggling with syntax at the same time as trying to find out what they can do with a new API, they can engage in trial-and-error to find out the capabilities and what makes it worth using, then build up their competency once they are clear about their objectives.

I'm looking forward to trying this now that it's available for my favorite IDE, but I'll probably want to set up a hotkey to switch it on and off as I need it. Once I get fully comfortable with something I often find code completion tools get in the way of my flow.

ctoth · on Oct 27, 2021

The next version of Copilot will submit its answers to HN and return the highest-voted comment that compiles, after stripping out the well actually spurious tokens. Just look how well it worked this time?

baby · on Oct 28, 2021

Yesterday I tried to convert the representation of a number into another representation/type in Rust:

    let coeff = BigUint::from_str(x).unwrap();
    let coeff: <<G1Affine as AffineCurve>::ScalarField as PrimeField>::BigInt =
        coeff.try_into().unwrap();
    let x: <G1Affine as AffineCurve>::ScalarField = coeff.into();

I wrote that, then I wanted to move that code in a function so I wrote:

    fn string_int_to_field_el(s: &str)

copilot suggested the following one-liner that did exactly the same thing:

    fn string_int_to_field_el(s: &str) -> <G1Affine as AffineCurve>::ScalarField {
        let x: <G1Affine as AffineCurve>::ScalarField = s.parse().unwrap();
        x
    }

I still don't understand how some heuristics could produce this code. It's mind blowing.

danachow · on Oct 28, 2021

That code is unadulterated garbage

nerdwaller · on Oct 28, 2021

But what if I told you I wrote it fast?

… as a top level comment said, this is optimizing for the wrong problem.

nerdwaller · on Oct 27, 2021

In a way it is impressive codepilot knew how to separate the query string while still being correct. However this is a fairly naive way to build a url that I wouldn't encourage committing. If I saw this in code in a review I would recommend using a dict and `urlencode` or one of the various other URL builders available (either in the stdlib or through another library like `yarl`/etc.).

dec0dedab0de · on Oct 27, 2021

Now we just need to train it to make a dictionary for that info instead of forming a long url. But if it has to use a long url to use urljoin and/or string formatting.

RheingoldRiver · on Oct 27, 2021

btw the `mwclient` library makes querying the Wikipedia API a breeze!

baby · on Oct 27, 2021

I’ve been using this for weeks and it blooooows my mind. It comes up with crazy recommendations, just yesterday I wrote this big ass logic to do something, then I wanted to move that code to a function so I wrote the function name and I kid you not copilot suggested a one-liner that worked… the thing is so useful and I’m not writing simple code (writing cryptographic code). And when it’s not doing that at the very least it provides auto-completion to lists where a counter has to increase and things like that. It’s just baffling, I don’t think I’m that directly impacted by AI, or at least this is the first time where I’m like “wow, AI really is changing the world RIGHT NOW”. Half-joking: can’t wait for it to just write my code.

Also if this feature would be paid tomorrow, I think I would pay for it. It’s really noticeable when I don’t have copilot enabled now.

Oh and, autocompletion doesn’t work with markdown files because of markdown plugins I think? But this is another level of insanity: when I’m writing english it figures a lot of the sentences I want to write. Makes me question if I’m just a deterministic individual with no choice.

superdisk · on Oct 27, 2021

> writing cryptographic code

Delegating the implementation of something that you are notoriously never supposed to roll your own, to a text generator AI.... What could possibly go wrong?

baby · on Oct 28, 2021

I think you got the wrong idea on how people use copilot. You don't just accept everything it throws at you. Think about it as an auto-complete on steroid, would you say that auto-complete is dangerous to use? If not, then this is the same. The suggestions of copilot don't always compile, but they sometimes manage to find the right function to use, or the right combination of functions, or even the right comment (if you're writing a comment).

Gigachad · on Oct 28, 2021

The problem with copilot is that other than the most basic boilerplate generation. It takes just as much effort to verify its output is correct than to come up with a correct answer.

baby · on Oct 28, 2021

I disagree, from my usage it sets up a lot of the boilerplate code, the types, etc. and will also find the correct functions and all without having me look up the doc again.

voldemort1968 · on Oct 28, 2021

In other words they didn't roll their own.

rpeden · on Oct 27, 2021

I was experimenting with it over the weekend and basically got it to build out a simulation of schooling/flocking boidfish swimming around a tank. That was fun! I certainly wasn't expecting it to create the whole darn thing, but it did. I just added an occasional comment to nudge it toward doing exactly what I wanted.

BatteryMountain · on Oct 28, 2021

What, you think you've been living in real-time all this time, exercising free will?

neilv · on Oct 27, 2021

A lot of the examples people are giving of code Copilot filled in for them sound like what would be called plagiarism, and probably also copyright infringement.

Which I think was fairly predictable.

What wasn't predictable was that someone would ship this Copilot anyway, consequently exposing their company and their users' companies to liability.

Imagine if you hired an intern who was copy&pasting bits of GPL'd code throughout your system. This would not be a good job, it would be something that needed immediate attention from legal counsel and others, and mean reverting every commit the intern from heck made if you couldn't prove convincingly it wasn't tainted. Especially if you're a startup, who needs to assure investor due diligence in good faith that you actually own the IP.

bastardoperator · on Oct 27, 2021

Wait til stackoverflow sues everyone into oblivion!

Letting your intern blindly commit to your code base seems like the bigger issue here. The entire purpose of an internship is to learn and to be guided by professionals, not to be treated as a cheap laborer. You don't hire interns, you train interns.

Have you used copilot or are you speculating?

neilv · on Oct 27, 2021

If the analogy works better, imagine that you hired a developer who copy&pasted GPL'd code throughout your system.

(And people couldn't tell in code reviews, nor on other occasions to see the code, since it's not normal to recognize GPL'd code on sight; everyone just assumed the developer was productive.)

marginalia_nu · on Oct 27, 2021

I think people should be a lot more concerned about the possibility of accidentally deriving your work on something with an AGPL license.

neilv · on Oct 27, 2021

If the analogy works better, imagine that you hired a developer who copy&pasted whatever-license-your-company-is-most-afraid-of-and-is-highly-likely code throughout your system. :)

crispyalmond · on Oct 27, 2021

>The entire purpose of an internship is to learn and to be guided by professionals, not to be treated as a cheap laborer. You don't hire interns, you train interns.

This has not been my experience. I was dropped into the developer team and expected to know the entire tech stack and was not trained by anyone from the company at any point. Have I been bamboozled??

bfung · on Oct 27, 2021

There’s a huge spectrum of “training”. At least you know you have an expected tech stack. Go learn it and ask questions if things are confusing. Don’t wait to get “trained”.

crispyalmond · on Oct 28, 2021

It helped that I was already semi-familiar with it, but the other engineer and team lead quit shortly after I joined, and I was left by myself to complete the tasks. It was brutal, and I'm looking for a different job to get away from this workplace for this reason.

progman32 · on Oct 28, 2021

Was anyone managing your task list or checking in at all? Did you have a manager?

crispyalmond · on Oct 28, 2021

We did and currently do have a manager type person. They are not code-savvy though, which is sometimes frustrating. We just recently hired another intern, a regular developer and a lead, so it's slightly less painful now, and I am still the one working on the primary applications. The others are working on separate things. Before the hiring, it was me and another developer, but he doesn't develop the main applications. The task list at that time was infrequently used, but it's become daily routine now.

ALittleLight · on Oct 28, 2021

I think it depends if you're getting paid. If they call you an intern because you're still in college or have no professional experience, but they're paying you fairly then it seems like a fine arrangement they ask you to do real work. On the other hand, if your compensation is primarily "experience" then I'd say you're being bamboozled.

crispyalmond · on Oct 28, 2021

I am being paid, although significantly less than my previous job. It's not that I can't do the work, but it feels like I'm being taken advantage of because I have no professional experience; although I have almost 10 years of personal experience. It might sound like I'm complaining about the work itself, but it's more than that to me.

bastardoperator · on Oct 28, 2021

It's not an internship if you have ten years of experience, you're just being exploited.

abalaji · on Oct 27, 2021

Stackoverflow content is generally creative commons not GPL unless I'm missing something.

(https://stackoverflow.com/help/licensing)

zvr · on Oct 31, 2021

CC-BY-SA, so Share-Alike; you have to share the derivative work produced (source code, in this case).

edwinyzh · on Oct 28, 2021

I was just wondering this

petepete · on Oct 27, 2021

Wait, they got Tim Pope on the case for the Neovim plugin? Amazing, I'm positive it'll work beautifully.

For those who don't know, he's essentially the godfather of vim plugins - I even have an entire 'tpope section' in my init.vim

fouc · on Oct 27, 2021

Haha, I mean, tpope has been working on vim plugins since 2005/2006 from my IRC memories, but there were vim plugins before that too.

ViViDboarder · on Oct 28, 2021

That’s what stood out to me too! His plugins are always so thoughtful and idiomatic Vim, they really ought to be default. I guess except this one should remain opt in.

jkelleyrtp · on Oct 27, 2021

I actually really like Copilot.

There tends to be a lot of repetitive code in the world. I primarily write JS, Py, and Rust. Sometimes, I might declare something like a function table, and Copilot will automatically fill in the class definition with everything I defined.

I'm not using Copilot to write new algorithms or solve library-specific problems, but it sure is next-level in picking up patterns in a file and predicting where you want to go next. Obviously, good code is succinct (not repetitive), but it sure is helpful when in that early prototyping stage. I admire it's ability to infer a correct assertion when writing Unit Tests - it made it much easier for me to write tests recently and helped me recognize a few bugs.

throwthere · on Oct 27, 2021

Same experience here. I think a lot of prime are being a little too philosophical about it. Where it shines is small helper functions and predictive boilerplate. It’s more like emmet imo then a pair programmer.

lionkor · on Oct 27, 2021

Good, I was getting really tired of subtly plagiarizing by hand.

chefandy · on Oct 27, 2021

Yesterday, I was disgusted to see framers putting up a house that clearly plagiarized the entire internal structure of my own. Same joint interfaces, same structural idioms when dealing with things like staircases, windows, and rafters, same fasteners, same adhesives, even the building materials! Aside from the most general aspects of the layout, it was exactly the same right down to the inch! People have no professional integrity these days.

bogwog · on Oct 27, 2021

That analogy only works if you designed/architected your own house.

chefandy · on Oct 27, 2021

Plagiarism can only be spotted by people who wrote the original work?

sterlind · on Oct 27, 2021

copilot's won't suggest anything worth calling plagiarism, just mundane plumbing and maybe textbook algorithm implementations. have you seen it generate anything more glorified than StackOverflow-esque code snippets?

josefx · on Oct 28, 2021

They had to blacklist Q_sort because they couldn't stop copilot from copying the complete function with comments (not just the algorithm) from the quake source, however it did not autocomplete the correct license for it.

danuker · on Oct 27, 2021

It's dicey according to the GPL FAQ [1]. It goes against what GPL authors want: their work being used in proprietary projects.

This could have been prevented very simply: GitHub avoiding training Copilot on GPL code.

What they can still do is offer a new model excluding GPL code for people who care about it.

[1] - https://www.gnu.org/licenses/gpl-faq.en.html#SourceCodeInDoc...

smitop · on Oct 27, 2021

They would also have to exclude virtually all code that isn't public domain, too. MIT and Apache-2.0 and BSD all require that the copyright notice and license text is preserved in downstream use.

aziaziazi · on Oct 27, 2021

It may work if you design/architect houses for clients.

standardUser · on Oct 27, 2021

I just call that "programming".

fouc · on Oct 27, 2021

As the saying goes, "A good programmer copies, but a great programmer steals outright."

parhamn · on Oct 27, 2021

I tried it on IntelliJ recently. The examples I tested blew my mind. Yet, I think there are two things that need to improve to get me to use it regularly (it will get there!):

- less than perfect import/types/var suggestions that LSP in typed languages would've made perfect suggestions for (e.g named import in go would use the package name instead).

- latency feels a bit high and my thoughts would get interrupted waiting for a suggestion to come.

For the former, I wonder hard feasible it would be to give structured suggestions to the LSPs that it would swap for correct var names and imports and such. Or test each suggestion with the LSP for error counts and offer the least erroring suggestion.

sillysaurusx · on Oct 27, 2021

> latency feels a bit high and my thoughts would get interrupted waiting for a suggestion to come.

How high? That's interesting.

baby · on Oct 28, 2021

I haven't had problem with the latency personally

threatofrain · on Oct 27, 2021

I've been getting a lot more misses than hits with Github Copilot, even when writing elementary math or utility functions; but despite its error I am nevertheless astonished at its approximation of intent.

Very eager to see Github Copilot catchup to some bright line of signal v noise.

baby · on Oct 27, 2021

I would say the reverse, I’m getting so many hits that I’m mindblown. And when it missed, I can generally still use that and fix the suggestion, as it’s faster.

f38zf5vdt · on Oct 27, 2021

It's a glorified autocomplete to me. It seems like it feeds me mostly things I'd get from searching Stack Overflow. The first day was pretty interesting but the novelty wore off quickly. You still need to grok what it spews out and see if it's correct.

And the worst thing about that is that you don't get the context of the Stack Overflow threads, where people discuss the impact of the given solution and alternatives. So after a week, off it went for me.

baby · on Oct 28, 2021

Do you use autocomplete, and would you say that it did a better job than autocomplete? If so why would you disable it?

f38zf5vdt · on Oct 29, 2021

I use autocomplete, but because of the async API calls it was slower than normal autocomplete.

Gigachad · on Oct 28, 2021

I'm constantly blown away with what it spits out even when its wrong. When it pulls in the greater context of the app and generates comments from scratch using the context of the file, its just incredible.

gavinhoward · on Oct 27, 2021

I have many thoughts about Copilot, but here are two.

First, as much as I don't like the idea of Copilot, it seems to be good for boilerplate code. However, the fact that boilerplate code exists is not because of some natural limitation of code; it exists because our programming languages are subpar at making good abstractions.

Here's an example: in Go, there is a lot of `if err == nil` error-handling boilerplate. Rust decided to make a better abstraction and shortened it to `?`.

(I could have gotten details wrong, but I think the point still stands.)

So I think a better way to solve the problem that Copilot solves is with better programming languages that help us have better abstractions.

Second, I personally think the legal justifications for Copilot are dubious at best and downright deception at worst, to say nothing of the ramifications of it. I wrote a whitepaper about the ramifications and refuting the justifications. [1]

(Note: the whitepaper was written quickly, to hit a deadline, so it's not the best. Intro blog post at [2].)

I'm also working on licenses to clarify the legal arguments against Copilot. [3]

I also hope that one of them [4] is a better license than the AGPL, without the virality and applicable to more cases.

Edit: Do NOT use any of those licenses yet! I have not had a lawyer check and fix them. I plan to do so soon.

[1]: https://gavinhoward.com/uploads/copilot.pdf

[2]: https://gavinhoward.com/2021/10/my-whitepaper-about-github-c...

[3]: https://yzena.com/licenses/

[4]: https://yzena.com/yzena-network-license/

treesprite82 · on Oct 27, 2021

> This means that Google was careful to not make a large amount of copy-righted works publically accessible. Such is not the case for GitHub Copilot in particular Armin Ronacher’s tweet [19]

The fast inverse square root algorithm referenced here didn't originate from Quake and is in hundreds of repositories - many with permissive licenses like WTFPL and many including the same comments. It's not really a large amount of material, either.

GitHub claims they haven't found any "recitations" that appeared fewer than 10 times in the training data. That doesn't mean it's a completely solved issue though, since some code may be in many repositories yet always under non-permissive licenses.

> and I would argue that it will not be the case for ML models in general because all ML models like Copilot will keep suggesting output as long as you ask for it. There is no limit to how much output someone can request. In other words, it is trivial to make such models output a substantial portion of the source code they were trained on.

With the exceptions mentioned above, what you get back from asking for more code won't just be more and more of a particular work. Realistically I think you'd be able to get significantly more from Google Books.

josefx · on Oct 28, 2021

>The fast inverse square root algorithm referenced here didn't originate from Quake and is in hundreds of repositories

With the exact same comments?

> many with permissive licenses like WTFPL

So it would be perfectly legal to do whatever I wanted with the source for GCC as long as there was a single fork on github that replaced the GPL with a MIT license? Quite sure the FSF would be perfectly fine with that.

treesprite82 · on Oct 28, 2021

> With the exact same comments?

Yep: https://github.com/search?p=1&q=evil+floating+point+bit+leve...

> Quite sure the FSF would be perfectly fine with that.

I believe the person republishing GCC code under MIT would be liable.

Also, I'm not recommending that you use code you know has been incorrectly licensed. Just that in cases where certain "folk code" is seemingly widely available under permissive terms, Copilot isn't doing much that an honest human wouldn't.

A better example against Copilot would be trying to get it to regurgitate some code that has a simple known origin and is always under a non-permissive license.

gavinhoward · on Oct 28, 2021

> The fast inverse square root algorithm referenced here didn't originate from Quake

Where did it come from then? And what license did the original have?

> and is in hundreds of repositories - many with permissive licenses like WTFPL and many including the same comments.

If the original was GPL or proprietary, then all of this copies with different licenses are violating the license of the original. Just because it exists everywhere does not mean Copilot can use it without violating the original license.

> It's not really a large amount of material, either.

No, but I would argue that it is enough for copyright because it is original.

> GitHub claims they haven't found any "recitations" that appeared fewer than 10 times in the training data.

Key word is "claim". We can test that claim. Or rather, you can, if you have access to Copilot, you can try the test I suggested at https://news.ycombinator.com/item?id=28018816 . Let me know the result. Even better, try it with:

    // Computes the index of them item.
    map_index(

because what's in that function is definitely copyrightable.

> With the exceptions mentioned above, what you get back from asking for more code won't just be more and more of a particular work. Realistically I think you'd be able to get significantly more from Google Books.

That can only be tested with time. Or with the test I gave above.

I think that with time, more and more examples will appear until it is clear that Copilot is a problem.

Nevertheless, a court somewhere (I think South Africa) recently ruled that an AI cannot be an inventor. If an AI cannot be an inventor, why can it hold copyright? And if it can't hold copyright, I argue it's infringing.

Again, only time will tell which of us is correct according to the courts, but I intend to demonstrate to them that I am.

treesprite82 · on Oct 28, 2021

> Where did it come from then? And what license did the original have?

From what I read, the code has been altered and iterated on as it was passed down. The magic number constant is claimed to have been derived by Cleve Moler and Gregory Walsh.

> If the original was GPL or proprietary, then all of this copies with different licenses are violating the license of the original. Just because it exists everywhere does not mean Copilot can use it without violating the original license.

If it was originally proprietary (this predates GPL) I believe the liability would be on whoever took that proprietary code and republished it under MIT/etc.

To be clear, I'm not recommending that you use code you know has been incorrectly licensed. Just that in cases where certain "folk code" is seemingly widely available under permissive terms, Copilot isn't doing much that an honest human wouldn't.

> Key word is "claim". We can test that claim. Or rather, you can, if you have access to Copilot

I don't unfortunately. As a side note, your function already existed in Apache-licensed code. But since it's not in many repositories I'd be willing to bet Copilot won't regurgitate it - I could message around a few people who might be able to try it.

> Nevertheless, a court somewhere (I think South Africa) recently ruled that an AI cannot be an inventor. If an AI cannot be an inventor, why can it hold copyright?

GitHub's intention isn't for Copilot to hold the code's copyright, but for the user to.

gavinhoward · on Oct 28, 2021

> GitHub's intention isn't for Copilot to hold the code's copyright, but for the user to.

That is true, so I have two things I can do:

1) I can argue that Copilot is actually the distributor of the code, which means Copilot is infringing, or

2) I can go after the user for infringing, and if I win, that user would not want to use Copilot anymore for liability reasons. Or they could go after Microsoft themselves.

Why not do both? So that's what I am doing, or rather, will do.

treesprite82 · on Nov 4, 2021

I got access to Copilot technical preview earlier today, here's the completion you wanted to try:

    // Computes the index of them item.
    map_index(int item, int *array, int size)
    {
     int i;
     for (i = 0; i < size; i++)
     {
      if (array[i] == item)
      {
       return i;
      }
     }
     return -1;
    }

Gigachad · on Oct 28, 2021

The boilerplate often exists because removing it requires assumptions and guesses. The language should never guess, copilot has the benefit of being able to give you wrong answers sometimes and you get to decide if they are correct.

gavinhoward · on Oct 28, 2021

I have no idea of what you are trying to say.

By definition, boilerplate is code that is repetitive. That repetition should be minimized by the programming language. If there are assumptions and guesses, that's because there's less repetition than you think.

I am talking about eliminating repetition, not code with differing assumptions.

Spinnaker_ · on Oct 27, 2021

How well can copilot write unit tests? This seems like an area where it could be really useful and actually improve software development practices.