Introduction to ARM Assembly Basics

wyc · on May 28, 2017

In case you're interested in seeing a project that does something, I wrote an IRC bot in ARM: https://github.com/wyc/armbot

CraigJPerry · on May 28, 2017

Some of the arm instruction set manuals seem harder to come by than others. Recently I reverse engineered one of the firmwares of the ImmersionRC Vortex 150 racing quadcopter (it uses an stm32f3 chip, arm cortex instruction set). It was pretty hard to come by a copy of the full instruction set manual - i just assumed that kind of thing would be a quick google away. Eventually i got there and made my changes but it was harder to get going than i expected but for different reasons than i was anticipating.

pm215 · on May 28, 2017

The instruction set reference (the architecture manual, to give it its proper title) has been freely available from ARM for some time, though you needed to create a website account and agree to a clickthrough license. It's more recently been made available as a simple download: https://developer.arm.com/products/architecture/a-profile/do... for cortex-a profile https://developer.arm.com/products/architecture/m-profile/do... for cortex-m profile

For documentation of the devices in a particular SoC you'll need the reference manual from the SoC manufacturer -- they of course vary in how easy it is to find those docs.

bobsam · on May 28, 2017

What are you talking about???

https://static.docs.arm.com/ddi0403/e/DDI0403E_B_armv7m_arm....

http://infocenter.arm.com/help/topic/com.arm.doc.dui0553a/BA...

There is also the old cheat sheet:

http://users.ece.utexas.edu/~valvano/Volume1/QuickReferenceC...

CraigJPerry · on May 28, 2017

None of these are the correct document?

I think you've inadvertently just helped make my case for me.

I don't have a link to share unfortunately, the only way i could get access was by going to ST directly.

bobsam · on May 28, 2017

First link, page 99 says: " armv7m instruction set"

The chapter after that even includes instruction encoding.

Maybe you were looking for something else?

jamiek88 · on May 28, 2017

Are you going to post the instruction set anywhere to help future hackers?

If so a link would be fantastic!

partycoder · on May 28, 2017

You can also compile assembly with gcc rather than as + ld.

And you can output assembly from C programs

    gcc -S <source file>

You can disassemble a binary to see how it actually looks. The resulting binary is much larger than your assembly code.

    objdump -d <binary file>

Many disassemblers will show you friendlier output than objdump. I use ht editor (packaged in Debian based distros as ht), an open source clone of Hiew. In ht, press F6 -> select image, and you will have an easy to follow disassembled version of a binary that you can edit, if you happen to know opcodes.

big_spammer · on May 28, 2017

What's with the branch instructions "Branch with Link" and "Branch with Exchange"?

What do they do? I haven't seen anything like this before.

I see some explanation in https://azeria-labs.com/arm-conditional-execution-and-branch... but the reason why branching must work this way isn't explained.

edit: I see "Branch with Exchange" switches the processor from ARM to Thumb in http://www.embedded.com/electronics-blogs/beginner-s-corner/.... But I don't see why switching processor modes must happen on branch instructions.

repiret · on May 28, 2017

It must be possible to change instruction sets when branching, in order to call or return from a function in the opposite instruction set.

There are branching instructions that can't change instruction sets for two reasons: 1. A direct branch can have more range if you can assume it's going to a 4-byte aligned address. 2. For compatibility with code written for ARM7DI and other really old pre-thumb processors. Since the original direct branch instructions ignored the bottom two bits of the target address, new direct branch instructions were added rather than change the behavior of the existing ones.

pm215 · on May 28, 2017

ARM and Thumb are just different encodings of the same instructions, mostly. Having switchover be done implictly via branch instructions is convenient for interworking -- you can link an ARM library with a Thumb application and it all just works, with function addresses having the low bit set for Thumb and clear for ARM. Generated code doesn't need to care beyond making sure it is using the right interworking instructions for call and return.

This is distinct from x86 32 vs 64 bit and ARM 32 vs 64 bit: in both those cases there's really a different processor mode with extra registers and so forth, and switchover is correspondingly more involved.

lacampbell · on May 28, 2017

Does any one here use a lot of assembly in their job? If so, what do you do?

I started getting fascinated about computer architecture a while back, but then I saw how dead embedded programming was in my area.

kabdib · on May 28, 2017

I've done several embedded projects in the past 5-10 years years. Mostly it's a matter of writing just enough assembly to get things running, and flipping into C when humanly possible because life is short.

Talk to old video game veterans [waves]. We wrote tons of assembly because there wasn't much choice. But these were largely 8-bit processors, the compilers weren't any good and the code space was constrained. But I was talking with a guy recently who said he'd written hundreds of thousands of lines of 68K assembly, and I have no idea why you would do that because 68K C compilers, Pascal compilers, anything compilers were pretty good even back in the benighted 80s. Well, better than assembly.

Of course, once you flip into C you're still not in an environment where you have much of a runtime (I kept having to explain to a contractor why he couldn't do heap operations in an early boot phase, much less expect the results to be addressable later).

Even in a very code-space sensitive project, I started off with a tiny bit of assembly, then made everything more or less functional in C, then went back and hand-coded routines as we needed to get bytes back: http://www.dadhacker.com/blog/?p=1911

My level of fascination with an architecture can be dramatically by the quality of the available tooling. If there's nothing then that's fine, green fields are great fun. But if the tooling sucks (TI and your DSP software, I'm looking at you) or is horridly expensive, then I'm usually going to look for excuses to use something else.

Stenzel · on May 28, 2017

Since you ask, I use lots of assembly level programming for digital audio signal processing. Some ARM instructions offer special DSP specific features like saturation and fractional arithmetic that have no equivalent int C, so it justifies using assembly. Smaller ARM cores without NEON offer some kind of miniature SIMD by operating on two 16-bit or four 8bit numbers, these can speed up things as well. To take full advantage of these it is best to write some portion of code in assembly. I usually wrap a processing sequence into a gcc-style inline assembly macro, this way I can easily compare with a C-only macro and maintain portability, and it saves me from having to deal with the complete instruction set and calling conventions.

oriolid · on May 28, 2017

I've also been working with DSP processors in audio processing. The only time I've used actual inline assembly was storing stack pointer to measure stack usage.

For saturating arithmetic and other stuff we used compiler intrinsics, which freed us from handling register allocation, stack management etc by hand. On that processor there weren't special instructions for saturating arithmetic but a flag was used instead, the compiler also kept track of that one too.

We did read the assembly result and tweaked C code until assembly looked like what was expected, though.

dmytrish · on May 28, 2017

Technically, assembly is not required for measuring stack: just use regular C address of any local variable/function argument (minus an address of some local variable in main) to have some approximation of stack usage.

oriolid · on May 28, 2017

True. But I was young, stupid and had just completed a course on assembler on that processor. Money well spent.

geokon · on May 28, 2017

Do you have any good examples?

It's something id like to try (however in c++) but I'm not sure how to do it in a smart/not-too-ugly way

vsampath · on May 28, 2017

I work on PC firmware (i.e. BIOS) and do have the pleasure of writing and debugging 16-bit x86 (although that's not my whole job description).

If you or anyone else is intrigued, we are hiring! https://nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAExternalCar...

pjc50 · on May 28, 2017

Like kabdib, I've done far more reading disassembly than writing it, because for most purposes it's easier to use C as a macro assembler.

I used to work next to someone programming Tilera manycore systems in assembler, because that's a sufficiently weird architecture that you need to do that in order to see any benefit. This is probably why manycore has never really taken off.

flohofwoe · on May 28, 2017

Not for writing code but for reading C/C++ compiler output during performance optimization and debugging sessions.

DamonHD · on May 28, 2017

Same here.

I do have to drop little snippets of asm in to code for bare metal stuff, but generally looking at the output of the compiler to see if it and I are on the same page is my main use.

However, as we may well be about to switch our IoT device to ARM from AVR, this might be a useful primer...

fla · on May 28, 2017

Mostly reading disassembly for some types of low-level debugging.

tmccrmck · on May 28, 2017

Anyone involved in compilers will inevtibaly be reading lots of assembly.

sjtgraham · on May 28, 2017

I'm only a couple of pages in but already a lot of this guide is incorrect for ARMV8-A (A64). Much is different, e.g. no thumb mode, no directly accessible program counter, no load multiple, no PUSH/POP, different stack pointer, etc. Looks good if you're interested in older ARM ISAs, which is probably more applicable for IoT etc.

morisGue · on May 28, 2017

Lol, that's because it's about ARMv6. She even points it out in Part 1 or so (not sure where but I saw it somewhere). If you wanna include all differences of ARM that would be a long tutorial.

sjtgraham · on May 28, 2017

I think you'll find the only reference to v6 is where the register names are explained, e.g. <= v7: r0, r1, etc. This is another thing it gets wrong, i.e. A64 registers can be x0, X1, etc, but also the lower 32 bits can be addressed via w0, w1, etc. http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc....

DigitalJack · on May 28, 2017

Page 1, right before the table cross-referencing ISA with ARM family. "The examples in this tutorial were created on an 32-bit ARMv6 (Raspberry Pi 1)."

Easy to miss, particularly if you are scanning through.

mordnis · on May 28, 2017

You do realise you are comparing 32 bit ARM and 64 bit ARM, which are very different ISAs?

sjtgraham · on May 28, 2017

Yes, that is exactly my point. This is something I think the guide should mention.

shepardrtc · on May 29, 2017

Thumb mode is mentioned, and in Part 3 its covered more in-depth.

bobsam · on May 28, 2017

Iot often use a slightly modified thumb2 which is actually a pretty recent addition to arm ISA.

repiret · on May 28, 2017

it matches ARMv8-A's AArch32 quite well :)

unixhero · on May 28, 2017

Nice

I enjoyed the read!

mishurov · on May 28, 2017

Why don't people use AT&T syntax for ARM?

theresistor · on May 28, 2017

AT&T syntax is a uniquely X86 thing.

Annatar · on May 28, 2017

No it isn't, as AT&T syntax is used on SPARC and Motorola 68000, for example. It has nothing whatsoever to do with any processor in particular.

danellis · on May 28, 2017

What do you mean by "AT&T syntax"? Neither 68000 nor SPARC use it, because it's an x86 thing. Do you really mean "operand order"? There's more to an assembly language syntax than just the order of the operands.

Annatar · on May 30, 2017

AT&T syntax is mainly about operand order, but there are assembler directives and constructs specific to AT&T as found in as(1) on any traditional UNIX, including illumos based ones. That every assembler has his own syntax is nothing new, compare and contrast Master SEKA with ASM-One on Amiga, or MASM, TASM or nasm, for example. AT&T as(1) syntax has nothing to do with x86. Any code written for SVR4 as(1) will be using AT&T syntax, irrespective of processor architecture. ISA used with AT&T as(1) will still be that of the processor, of course, but instead of things like a0 or d0, it'll be %a0 and %d0, for example.

FubarCoder · on May 28, 2017

Intel syntax is an x86 thing.

mishurov · on May 28, 2017

I was very surprised when I'd discovered that the assembly in the Linux kernel sources for x86 was written in the AT&T syntax and the assembly for ARM was not. I always thought that the AT&T syntax was supposed to be independent from an architecture.

SepUltra · on May 28, 2017

"Architecture independent assembly language". Just say that out loud and set it sink in for a moment. Anyways, glad that there's no AT&T abomination for ARM, too.

Houshalter · on May 28, 2017

And web assembly is written in some kind of weird lisp inspired s expression format.

SepUltra · on May 28, 2017

WebAssembly is really WebBytecode. https://en.wikipedia.org/wiki/WebAssembly