bitset Find_first and Find_next

→ Обратите внимание

До соревнования
Rayan Programming Contest 2024 - Selection (Codeforces Round, Div. 1 + Div. 2)
4 дня
Зарегистрироваться »

*есть доп. регистрация

→ Лидеры (рейтинг)

№	Пользователь	Рейтинг
1	tourist	3993
2	jiangly	3743
3	orzdevinwang	3707
4	Radewoosh	3627
5	jqdai0815	3620
6	Benq	3564
7	Kevin114514	3443
8	ksun48	3434
9	Rewinding	3397
10	Um_nik	3396

Страны | Города | Организации

Всё →

→ Лидеры (вклад)

№	Пользователь	Вклад
1	cry	167
2	Um_nik	163
3	maomao90	162
3	atcoder_official	162
5	adamant	159
6	-is-this-fft-	158
7	awoo	155
8	TheScrasse	154
9	Dominater069	153
10	nor	152

Всё →

→ Найти пользователя

→ Прямой эфир

Детальнее →

Блог пользователя Arpa

bitset Find_first and Find_next

Автор Arpa, 9 лет назад, По-английски

Hi !

Today while solving 356D - Bags and Coins I needed a function for bitset in order see what is the first set bit.I asked M.Mahdi and he told me about bs._Find_first(). for example:

bitset<17>BS;
BS[1] = BS[7] = 1;
cout<<BS._Find_first()<<endl; // prints 1

After more research , we found bs._Find_next(idx). This function returns first set bit after index idx.for example:

bitset<17>BS;
BS[1] = BS[7] = 1;
cout<<BS._Find_next(1)<<','<<BS._Find_next(3)<<endl; // prints 7,7

So this code will print all of the set bits of BS:

for(int i=BS._Find_first();i< BS.size();i = BS._Find_next(i))
    cout<<i<<endl;

Note that there isn't any set bit after idx, BS._Find_next(idx) will return BS.size(); same as calling BS._Find_first() when bitset is clear;

UPD One question, bitset is 32 or 64 times faster than bool array?

bitset, c++

Arpa
9 лет назад
32

Комментарии (29)

Показать архивные | Написать комментарий?

NSV

9 лет назад, # |

← Rev. 3 →

Really nice, thanks! I just used something like this:

const int T = 8;
int index[1 << T];

void go() {
  uint8_t *data = (uint8_t*)(&st);
  const int n = st.size() / T;
  for (int i = 0; i < n; ++i) {
    uint8_t cur = data[i];
    while (cur) {
      uint8_t lower = cur & -cur;
      cur ^= lower;
      int idx = index[lower] + i * T;
      cout << idx << " ";
    }
  }
}

void init() {
  for (int i = 0; i < T; ++i) {
    index[1 << i] = i;
  }
}

But your code is much better. Unfortunately it works only with GNU C++ compiler.

Does anybody know how to find lower bit index with O(1) for uint32_t or uint64_t?

UPD It's easy if divide 32-bits number into 4 parts and use precalc.

→ Ответить

Kaban-5

9 лет назад, # ^ |

+10

__builtin_ctz and __builtin_ctzll count amount of trailing zeros (or lowest set bit, which is the same) in int / unsigned int and long long / unsigned long long integers respectively (but for some reason, when argument is zero their return value is undefined). They are claimed to be O(1) and actually work rather fast (even precalc is not much faster then them, if faster at all), unlike __builtin_popcount, which is slightly slower than accurate handwritten $\text{[math]}$ and much slower than O(1) with precalc (at least from my experience).

→ Ответить

NSV

9 лет назад, # ^ |

Great! One more advantage to use GNU C++ compiler :)

→ Ответить

Anushi1998

6 лет назад, # |

← Rev. 2 →

Is there any documentation for this _Find_next? What if I wanted to know the position of MSB in bitset? (without converting to string and then reversing it)

→ Ответить

Arpa

6 лет назад, # ^ |

What it MSB?

→ Ответить

vntshh

6 лет назад, # |

What is the time required for_Find_first? Is it O(n) or O(1) or something like O(n / 32) ?

→ Ответить

Arpa

6 лет назад, # ^ |

O(n / 32).

→ Ответить

ikovrigin

6 лет назад, # ^ |

-8

Generally speaking Find_first uses 64 bit register. But speed will be near the same with 32 bit. So writing speed in big O/theta notation in this case is not quite right.

→ Ответить

Arpa

6 лет назад, # ^ |

Yes, you're right, but using this notation is popular.

→ Ответить

ikovrigin

6 лет назад, # ^ |

← Rev. 2 →

-17

$\text{[math]}$ P.S. For those who downvoted. Even if you don't like math it will not changed. :)

→ Ответить

Swistakk

6 лет назад, # ^ |

← Rev. 3 →

+22

You're right and wrong at the same time. In more theoretical setting $\text{[math]}$ is just a shorthand for $\text{[math]}$ because that's what we get in most popular model of computation we use everyday where we are allowed to perform constant time operations on integers of size (==length of binary representation) up to logarithm of memory size. You wouldn't want adding two integers to take $\text{[math]}$ time, right?

In more practical setting, we can express it as $\text{[math]}$ where w is a word size of system we are working on. But maybe in a parallel universe there are computers with word sizes of 1048576 :)? It shouldn't be simplified to O(n), since that w is clearly a variable, right? Fact that in all computers we have w is either 32 or 64 is another matter.

By the way, note the bridge between theoretical and practical setting — why is w 32 on older computer and 64 on newer ones? Because that is how many we need in order to address memory we want (lowest power of two to be more precise), so in fact w is really a logarithm of memory size we have access to.

→ Ответить

ikovrigin

6 лет назад, # ^ |

← Rev. 4 →

-16

I know i will get minuses again but i don't care. Even if ur a Legend you didn't right. In this case it doesn't matter what the word size of our computational model. Let w is computer word size and it is constant for computer. O(n/w) = O(n) and not equal to n/log n. Word will be the same and will be constant for computer it didn't depend on n.

In practice O can be used as a coefficient when we want to know how slow will be our algo if we multiply input by coefficient n. Let s is input data size and T(s) is execution time. And now we know T(s) for some s (is big). Now lets multiply s by n so T(s*n) = T(s)*O(n) this is how O notation works.

P.S. We didn't say about adding integers it is a bitset. So all i am saying above is about bitset operation. When we say about arithmetic on integers i will agree with you.

→ Ответить

MrDindows

6 лет назад, # ^ |

But as Swistakk wrote, theoretically w is not a constant. You may assume that your algo can be run on different machines with different architectures.

Formally speaking about math, with O notation for algorithm we estimate not an execution time for fixed computer, but the number of primitive operations, which will be different for machines with different w.

Formally speaking about practice, O(n/32) algorithms are usually much faster than O(n), so we write O(n) and O(n/32) to separate them.

→ Ответить

ikovrigin

6 лет назад, # ^ |

w is not depend on n so O(n/log n) is not correct notation in any computation model (or any other sense) and is not the same O(n/w).

→ Ответить

riadwaw

6 лет назад, # ^ |

← Rev. 2 →

well, it kinda isn't and kinda is because in RAM-model you can do in O(1) operations on words you use to access memory. And you have n ≤ M ≤ 2^w because if n > M you can't get such input and if M > 2^w you can't access all of your memory. So you have $\text{[math]}$

→ Ответить

ikovrigin

6 лет назад, # ^ |

← Rev. 2 →

-17

Ahh i see Legends and Grandmasters are joking me. Ok :). Let's continue. Memory size is not always lower than 2^w. See x86 XMS and EMS specifications.

→ Ответить

riadwaw

6 лет назад, # ^ |

+16

We are speaking about models, not real PCs. On real PCs everything is O(1).

→ Ответить

ikovrigin

6 лет назад, # ^ |

← Rev. 3 →

-8

With all respect to Swistakk, MrDindows and riadwaw but RAM-machine computational model has unlimited input and output it has limited number of Registers and it is not the same. for example .... So i still think you are wrong. You can not limit n with 2^w.

P.S. For limited in/out you will always have complexity like in "real PCs everything is O(1)" (c) riadwaw

→ Ответить

q234rty

6 лет назад, # ^ |

"Because the model assumes that the word size matches the problem size, that is, for a problem of size n, $\text{[math]}$ "

Quoted From This Wikipedia Article

→ Ответить

allllekssssa

6 лет назад, # |

← Rev. 2 →

+10

I think this is the best blog for this kind of questions.

First question was already asked, how to find most significant bit in the bitset?

Second question, is there any way for easy manipulation with ranges in bitset — something like set all values in range (l, r) or flip all values in range (l, r)?

All this things can be done easy with own implementation of bitset, but is there any way to do this with std: : bitset.

→ Ответить

CountZero

6 лет назад, # ^ |

← Rev. 4 →

bitset supports bitwise operators http://www.cplusplus.com/reference/bitset/bitset/operators/

so you can prepare a "mask bitset" (i.e. 000...001111...111000..000) and use it.

bitset<sz> mask(int from, int to) { // to > from 
  bitset<sz> a, b;
  a.flip();
  a <<= from;
  b.flip();
  b <<= to;
  b.flip();
  return a & b;
}

→ Ответить

riadwaw

6 лет назад, # ^ |

But it will work in O(n), not in O(r-l) as it could

→ Ответить

allllekssssa

6 лет назад, # ^ |

← Rev. 2 →

It is good thing, you wrote this code! Now I can ask one more questions :)

What is complexity of creating one bitset of size n? Is it O(n), or $\text{[math]}$ ? I do not see reason why it would be O(n), but from other side I know it has some other constructors with binary strings, which must be O(n).

→ Ответить

MrDindows

6 лет назад, # ^ |

← Rev. 2 →

You can do reinterpret cast of std::bitset pointer to the int32* and work with it like with your own implementation of bitset. This is what I usually do when I need some specific function for the std::bitset, that it does not have.

→ Ответить

allllekssssa

6 лет назад, # ^ |

Thanks, it is not bad option too!

→ Ответить

Swistakk

6 лет назад, # |

I don't remember it in details, but bitset in fact has a function for k-th bit, however it is declared as private... I have no idea why would someone not expose such useful function to world and deem it as private, but #define private public is there to help you. It really works. You need to look up implementation and figure out by yourself how is it called, maybe do some other trick etc. However I have to admit I was convinced that what I mentioned is the simplest way of getting first alive bit which is a very usual concern. Good to know it is as simple as that.

→ Ответить

CodingKnight

6 лет назад, # ^ |

← Rev. 8 →

Other undocumented member functions of bitset<Number_of_Bits>:

bitset<Number_of_Bits>& _Unchecked_set(size_t bit_position)
bitset<Number_of_Bits>& _Unchecked_reset(size_t bit_position)
bitset<Number_of_Bits>& _Unchecked_flip(size_t bit_position)

Any of these functions does the same as its equivalent

bitset<Number_of_Bits>&   set(size_t bit_position)
bitset<Number_of_Bits>& reset(size_t bit_position)
bitset<Number_of_Bits>&  flip(size_t bit_position)

The only difference is that the Unchecked version assumes that bit_position is between 0 and Number_of_Bits-\1. Therefore, no range checking to validate its value is performed. This can speed up repetitive processing of large-size bitset objects when the function argument is guaranteed to be valid before calling any of the former functions.

Note that these undocumented functions are public members of the class. Therefore, the #define private public trick is not required to use any of them.

→ Ответить

Nots0fast

4 года назад, # ^ |

← Rev. 2 →

I couldnt find this, could you provide more details on this ?

→ Ответить

Swistakk

4 года назад, # ^ |

Me — not really, I never used it. Maybe tribute_to_Ukraine_2022 can?

→ Ответить

Соревнования по программированию 2.0

Время на сервере: 26.11.2024 06:14:45 (f1).

Десктопная версия, переключиться на мобильную.

При поддержке