More

azag0 · on Oct 27, 2017

But it's also very obvious that general relativity can only be an effective theory that fails for very strong fields. No one I know actually thinks there are singularities, those just point to places where GR starts to significantly deviate from reality. Navier–Stokes equations are very accurate on macroscopic scale, but regardless of their mathematical properties, there are no actual blow ups in real world, because ultimately, the liquid is composed from atoms. Likewise, the fields in GR must be ultimately quantized in some way, which will almost certainly break the singularities.

The infinities in QFT are really just a hack. I mean, all those calculations are perturbational in the first place, so they can be hardly considered a "true" picture of reality.

azag0 · on Oct 25, 2017

It's rather the degree to which Python is dynamic that makes it slow. PyPy could be considered an implicit JIT compiler for Python, yet it is still far slower than Julia. The level of magic you can apply to Python objects that the interpreter/compiler must support is just a different league than Julia. I'd be interested if someone could compare to JS.

pjmlp · on Oct 25, 2017

Lisp, Smalltalk, Dylan and SELF allow for the same kind of magic.

The JIT developed for SELF is the genesis of Hotspot.

JRuby guys have a quite good implementation making use of Graal, and Ruby is not less magical than Python.

In the end it boils down to how much the community prefers to keep on using C, or improve PyPy.

EDIT: Fixed auto-correction induced typo.

shellac · on Oct 25, 2017

s/Gradle/Graal/?

In the case of python it's clear that the heavy reliance on c extensions is a blessing and a curse: it's kept python relevant in communities like science even though it isn't very fast. However one of the lessons of Graal seems to be that such extensions can seriously prohibit improving performance, since they are opaque to JITs.

There's a few talks by Chris Seaton (e.g. https://www.youtube.com/watch?v=YLtjkP9bD_U) on the topic.

pjmlp · on Oct 25, 2017

Yeah, typo due to auto-correction.

azag0 · on Oct 19, 2017

I’d expect the conditioning to be subconscious. Doesn’t matter the computer is locked down. Your brain sees a computer, which triggers a multitasking, distraction-rich mode.

azag0 · on Oct 15, 2017

Vast majority of scientists is not able to write idiomatic Fortran, yet alone idiomatic C++. Scientific C++ code that didn't have an oversight from a professional C++ developer will be always horrible. Scientific Fortran code written without such oversight can sometimes be bearable. This is perhaps the main advantage of Fortran.

gh02t · on Oct 15, 2017

Eh, I'm talking mostly about the large scientific code packages that are being developed with millions of dollars in funding and large, organized teams. The people writing these sorts of codes know what they are doing and a lot of migration to C++ is because they are more familiar with it and it's easier to hire skilled people.

jjdredd · on Oct 15, 2017

That's a very good point.

azag0 · on Oct 14, 2017

That all depends on factoring. If running your code once means evaluating some function hundred times, then that's no issue.

azag0 · on Oct 14, 2017

One-based indexing is not a semantic issue, it's a language-design decision you may disagree with.

PeachPlum · on Oct 14, 2017

Julia 0.5 introduced support for any indexing scheme you care to invent.

1-based 0-based 20-based

https://docs.julialang.org/en/latest/devdocs/offset-arrays/

wtetzner · on Oct 14, 2017

Should array indices start at 0 or 1? My compromise of 0.5 was rejected without, I thought, proper consideration.

-Stan Kelly-Bootle

attractivechaos · on Oct 14, 2017

That is an ugly hack, mainly because it was not in the original language design.

ChrisRackauckas · on Oct 14, 2017

it is in the original language design which is all about type-genericness and separation of implementation from interface with zero-cost abstractions. Creating an array with a different `getindex` dispatch is a great example of what Julia's type system was made to do! The standard library chose to use 1-base indexed contiguous fixed-dimesion dynamic arrays, but that's not the right choice for every problem.

As evidence of this, check out JuliaArrays (https://github.com/JuliaArrays) which is a whole Github organization devoted to the development of alternative array types, like StaticArrays (which are stack-allocated immutable arrays) or CatViews (arrays which are non-contiguous and constructed from views of multiple different arrays). The nice thing about Julia is that, if packages are written to work with generic types, they can natively (and efficiently) work with these "non-standard" array types, making them easy to integrate into the scientific ecosystem.

pjmlp · on Oct 14, 2017

Why ugly? It looks similar to what I know in the Pascal family, where indexes can be ranges or enumerations.

attractivechaos · on Oct 14, 2017

It is ugly because 1) 1-based indexing and x-based indexing are treated differently; 2) x-based indexing has to use more complex syntax, which in effect discourages the use of non-1 indexing; 3) this strategy sets potential pitfalls (e.g. implementing length and size for non-1 indexing arrays). A cleaner design, I guess, would be to specify index range on declaration like pascal static arrays and use low(A):high(A) for iteration rather than 1:length(A). This, however, complicates 1-based use cases.

Generally, I don't think there is a good way to achieve flexible indexing without causing troubles somewhere, so I don't think Julia has really solved the problem.

ChrisRackauckas · on Oct 14, 2017

1) No, they are just different dispatches to getindex. 2) No, iteration is through indices(A) or eachindex(A), etc., which are the preferred way of iterating anyways. You shouldn't do 1:length(A) which is a MATLABism that works but I would say isn't good Julia. 3) Defining new dispatches for length and size is a pretty standard use of the language?

"Non-standard" arrays with non-standard indexing already work in lots of packages. It could be better (that's one of the things that I am advocating for), but it's not a language tooling issue whenever it's a problem, it was the developer going `::Array` and thus requiring a contiguous 1-base index array where other AbstractArrays would actually work.

ChrisRackauckas · on Oct 14, 2017

@attractivechaos I don't know how to reply to your last reply, so I'll do it here. 1:length(A) is bad because it's using a standard construction for intervals of numbers, but using it for indices. We don't want to get rid of it because 1:5 or 0:0.2:1 is something that is very common and necessary, but I don't see how to tell one that they should instead use eachindex(A) except through proper docs. 1:length(A) is so common in MATLAB though that I am sure people will carry it over, and I'll PR to their library to fix it. I'm not sure how to fix a knowledge issue like that.

You're not understanding generic types and its relation to (1). There's only one way to access an array: getindex. That's the function that's called with A[i]. However, you can use an immutable to put a thin (zero-cost) wrapper over an array, and define dispatches to getindex to do whatever you need it to do. So it's both implicit syntactically because the user just does A[i], but it's explicit because the user has to choose a different type. getindex is then usually inlined and then compiled according to the type, making it a thin abstraction over the implementation.

There are iterators which don't have a size or length. You can write generic algorithms which require an AbstractArray which HasLength and query at compile time for things like that and throw appropriate errors (those are called traits).

There is still a lot of development to do here, but the basics like this are pretty much solved except when new users treat Julia like MATLAB, but I'm not sure how anyone could control for that.

conistonwater · on Oct 14, 2017

You can click on the timestamp, and there will be a reply link there. I think reply links are hidden for a little bit of time after posting, but I'm not sure why.

pygy_ · on Oct 14, 2017

This is an anti-flame war feature, designed to let people cool off before replying (fast paced discussions were often contentious before that was introduced).

attractivechaos · on Oct 14, 2017

I guess so. This is a well thought feature. I like it.

attractivechaos · on Oct 14, 2017

On 2), if you think 1:length(A) is bad, why not forbid it from beginning (e.g. use low(A):high(A) instead)? To find an x-based array length, why not just length(A), instead of length(linearindices(A))? Decisions like such are remedies of immature early design. Also, what if I use length() on an x-based array? Abort or a wrong number silently? On 1), having two different ways to access array, the most fundamental data type, is already worrying enough. On 3), the page says "don't implement size or length". That is very uncommon in most other mainstream languages.

Julia has potential to become a great general-purpose programming language, but this indexing issue will practically limit it to the numerical computing community. Perhaps achieving that is already good enough.

mkborregaard · on Oct 14, 2017

The `length(linearindices)` and no `size` etc are for a short transitory period while packages are ported to the indexing-based-non-reliant framework.

mkborregaard · on Oct 14, 2017

One of the most mindboggling thing about the recurrent 0- vs 1-based indexing discussion is how incredibly rarely that difference is ever used programmatically in Julia. Most of the large julia packages are programmed in a way that doesn't care whether the array is 0 or 1 based. It is important in some other languages, and then I just think people are happy that this is something that everybody can agree to disagree on. I don't think the discussion is very productive though.

attractivechaos · on Oct 14, 2017

Hmm... I wanted to learn more about 0-indexed arrays, but after looking around, I still have not figured out how to declare 0-indexed arrays without extending the AbstractArray interface or using another package.

disconnected · on Oct 14, 2017

I don't have any particular feelings toward one or the other (it is a convention, get over it), but I think that zero-based indexing is just an artifact of C that stuck around.

In C, the array syntax is "mostly" just syntactic sugar for pointer arithmetic.

When you do "a[n]=value;" this is equivalent to "*(a+n) = value;". To get the nth cell of an array, you just add "n" to your base pointer "a".

Array indexing, therefore, is consistent with the pointer arithmetic.

That said, and funnily enough, Fortran, which is much older than C, has 1-based indexing (by default, but you can configure 0 based indexing, if I remember correctly).

pvg · on Oct 14, 2017

Zero based indexing is not a C artifact. Here's Dijkstra writing about it in '82:

https://www.cs.utexas.edu/users/EWD/transcriptions/EWD08xx/E...

Bromskloss · on Oct 14, 2017

Note how, in the PDF version [0], Dijkstra numbers the pages starting on zero (handwriting, upper right corner), but whoever created the PDF disregarded its message and did numbering starting on one (lower right corner). :-)

[0] https://www.cs.utexas.edu/users/EWD/ewd08xx/EWD831.PDF

Edit: Fixed link. Hacker News apparently doesn't understand the delimiting of a URL with "<" and ">". >-( https://tools.ietf.org/html/rfc3986#appendix-C

Avshalom · on Oct 14, 2017

There's the very real possibility that zero based indexing is in fact a Yacht Racing artifact.

http://exple.tive.org/blarg/2013/10/22/citation-needed/

Bromskloss · on Oct 15, 2017

> The social reason is that we had to save every cycle we could, because if the job didn’t finish fast it might not finish at all and you never know when you’re getting bumped off the hardware because the President of IBM just called and fuck your thesis, it’s yacht-racing time.

I don't buy it. Wouldn't people want their programs to run fast regardless of this?

CyberDildonics · on Oct 15, 2017

In C arrays are given an offset to the pointer, not an index, that is why they start at 0.

pjmlp · on Oct 14, 2017

Which happens to common across many languages outside C universe.

Some languages, like the Algol ones, even have user defined indexing.

So it is not neither 0 or 1, rather whatever the min value of the index happens to be.

goatlover · on Oct 15, 2017

Like every other language-design decision?

azag0 · on Oct 4, 2017

Correct, but git doesn't recompute the hashes locally, so it wouldn't know they are wrong.

u801e · on Oct 4, 2017

Ah, so if I were to manually craft a commit in a text editor in the format:

    tree sha1
    parent sha1 of parent I want to attach it to
    author some string
    committer some string

    The commit message

I could add this to the git object store manually under the same sha1 file and a client could just fetch it? Would the client try to fetch the faked objects when it already has the real objects in its copy of the object store?

That is, would it think it has the commit because the sha1 hasn't changed, but the tree sha1 has been updated and it would presumably refer to blobs that the client doesn't already have and try to fetch them. Or would it not proceed because it already has the commit?

simonbyrne · on Oct 4, 2017

It doesn't seem to verify hashes of objects on checkout, but it does when receiving packfiles. So it's difficult to see how this could be an exploit unless the attacker has access to your local .git directory.

lilyball · on Oct 4, 2017

If you're syncing the repository itself (e.g. over Dropbox) instead of using git remotes, then it could be exploited.

mlindner · on Oct 4, 2017

Why the hell would you do that. That defeats the point of git.

ianamartin · on Oct 4, 2017

Because anything that can be misused will be.

I’m sure there’s a law with someone’s name that states that. But just in case it hasn’t been claimed yet, I’m proposing that we call it the fuck you law. Because the next time someone comes to me to ask me to fix their trello to zappier to email to google sheets setup they use as a project management tool, I want to be able to say, “Fuck you and there’s a law that says so.”

FullyFunctional · on Oct 4, 2017

No it doesn't. I have many of my git repos in Dropbox but I'm not using Dropbox for sharing. Having those in Dropbox means I get automatic backup and that they are available when I switch to a different computer, which I do, but not frequently. As only I use my Dropbox account, I'm aware of the potential sync problem, but it's never been a problem. I do run fsck & gc more frequently than most, but I probably don't need to.

EDIT: I should emphasize that this model is way more convenient than manually having to remember to push and pull all the time. Now push is only for publishing outside as it should be.

mlindner · on Oct 5, 2017

If you're doing this then there's no reason to use git. Just sync a raw directory.

FullyFunctional · on Oct 7, 2017

No, I use git to track my development history and I push to github. These are two difference issues.

azag0 · on Sept 28, 2017

They actually use KaTeX [1], which is much faster than MathJax, supports server-side rendering, but supports a smaller subset of LaTeX than MathJax.

[1] https://khan.github.io/KaTeX/

azag0 · on Sept 27, 2017

> Perhaps not the black holes doesn't emit light directly, but the surrounding gas will be extremely hot.

Yes, but not every black hole is surrounded by gas.

> the virtual particles should be very confused

In general, physics does not work like this.

azag0 · on Sept 27, 2017

I've always told my parents who grew up in communist Czechoslovakia that the Chinese communism is a very different beast from the eastern-bloc communism they've know (which Vaclav Havel described masterfully in his texts). But this article would feel very familiar to them.