This paper proposes improvements to the binary grey-wolf optimizer (BGWO) solve feature selection (FS) problem associated with high data dimensionality, irrelevant, noisy, and redundant that will then allow machine learning algorithms attain better classification/clustering accuracy in less training time.We propose three variants of BGWO addition standard variant, applying different transfer fu...